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A  PERSPECTIVE  QN  MULTIACCESS  CHANNELS 

I.  INTRODUCTION 

This  paper  is  an  expanded  version  of  the  Shannon  lecture  at 
the  Internat ional  Symposium  on  Information  Theory  at  St.  Jovite, 
Quebec ,  in  September ,  1983.  For  the  last  ten  years  there  have 
been  at  least  three  bodies  of  research  on  multiaccess  channel s, 
each  proceeding  in  virtual  isolation  from  the  others  and  each 
using  totally  different  models.  The  objective  here  is  to 
contrast  these  bodies  of  work  and  to  give  some  perspective  on 
what  is  needed  to  provide  some  unification  between  the  areas.  We 
shall  refer  to  the  three  areas  as  collision  resolution* 
multiaccess  information  theory*  and  spread  spectrum. 

The  kind  of  communication  situation  that  these  three  areas 
address  is  illustrated  in  fig.  1.1.  There  are  multiple 
transmitters  and  a  single  receiver.  The  received  signal  is 
corrupted  both  by  noise  and  by  mutual  interference  between  the 
transmitters.  Each  of  the  transmitters  is  fed  by  an  information 
source*  and  each  information  source  generates  a  sequence  of 
messages*  successive  messages  arriving  at  random  instants  of 
time.  There  is  usually  some  small  amount  of  feedback  from  the 
receiver  to  the  tranmitter*  but  this  feedback  will  not  be  our 
main  focus.  Our  major  focus*  rather,  is  on  the  interference,  the 
noise,  and  the  random,  or  "bursty",  message  arrivals. 

This  type  of  model  is  appropriate  for  the  up  link  of  a 
satellite  network,  for  a  radio  network  where  there  is  one  central 


repeater*  and  for  the  traffic  to  the  central  node  on  a  multidrop 


telephone  line.  It  is  also  adequate  in  most  respects  for 
studying  networks  where  a  common  channel  allows  all  nodes  to  hear 
all  other  nodes.  Common  examples  are  a  cable  connecting  many 
nodes  and  a  fully  connected  radio  network. 

The  beginning  of  the  collision  resolution  approach  to 
multiaccess  communicat ion  came  in  1973  with  Abramson's  Aloha 
network  tlD.  The  idea  here  was  that  wherever  a  message  (or 
packet >  arrived  at  a  transmitter,  it  would  simply  be  transmitted* 
ignoring  all  other  transmitters  in  the  network.  If  another 
transmitter  was  transmitting  in  an  overlapping  interval, 
interference  would  prevent  the  message  from  being  correctly 
received,  the  cyclic  redundancy  check  <CRC>  would  not  check,  no 
acknowledgement  would  be  sent,  and  the  transmitter  would  try 
again  later;  the  later  time  would  be  pseudorandom ly  chosen  to 
avoid  the  certainty  of  another  collision  if  both  transmitters 
waited  the  same  time. 

Over  the  years,  this  basic  strategy  has  been  improved, 
generalized,  and  analyzed  in  many  ways.  A  number  of  variations 
are  in  widespread  use,  and  the  general  topic  of  collision 
resolution  has  provided  many  challenging  and  interesting  problems 
for  research.  Section  4  provides  an  introduction  to  these 
problems  and  most  of  the  other  papers  in  this  special  issue  are 
devoted  to  the  current  state  of  these  problems. 

Collision  resolution  research  has  always  focused  on  the 
bursty  arrivals  of  messages  and  the  interference  between 
transmitters,  but  has  generally  ignored  the  noise.  More 
generally,  this  approach  ignores  the  underlying  communication 


process*  assuming  only  that  a  message  transmission  is  correctly 
received  in  the  absence  of  collision  and  incorrectly  received 
otherwise. 

The  multiaccess  information  theoretic  approach  to 
multiaccess  also  started  in  1973  with  a  coding  theorem  developed 
by  Ahlswede  £23  and  Liao  £33.  This  work  has  also  been 
generalized  in  many  ways  and  has  opened  up  a  separate  area  of 
research  problems.  Excellent  summaries  and  descriptions  of  this 
research  are  given  in  £4*5*61.  In  this  approach*  the  noise  and 
interference  aspects  of  the  multiaccess  channel  are  appropriately 
modelled*  but  the  random  arrivals  of  the  messages  are  ignored. 

Before  proceeding,  it  is  important  to  understand  why 
information  theorists  and  communication  system  designers  have 
always  essentially  ignored  random  message  arrivals  for  point  to 
point  channels*  and  why  this  is  usually  unreasonable  for 
multiaccess  channels.  For  a  point  to  point  channel,  one  normally 
assumes  an  infinite  reservoir  of  data  to  be  transmitted.  The 
reason  for  this  is  that  it  is  a  minor  practical  detail  to  inform 
the  receiver  when  there  is  no  data  to  send;  furthermore  there  is 
no  other  use  for  the  channel*  so  potential  lack  of  data  might  as 
well  be  left  out  of  the  model.  For  multiaccess  channels,  on  the 
other  hand*  most  transmitters  have  nothing  to  send  most  of  the 
time*  and  only  a  few  are  busy.  The  problem  is  then  to  share  the 
channel  between  the  busy  users,  and  this  is  often  the  central 
technical  problem  in  multiaccess  communication. 

A  pure  theoretician  would  properly  point  out  here  that 
bursty  message  arrivals  have  nothing  to  do  with  coding  theorems 
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for  multiaccess  channels.  The  arrivals  have  to  do  with  the 
sources  and  can  and  should  be  dealt  with  through  source  coding. 
Even  without  source  coding*  if  the  arrival  process  is  ergodic, 
then  over  the  arbitrarily  long  time  intervals  used  in  the  coding 
theorems,  the  bursty  arrivals  will  not  matter. 

From  a  more  practical  point  of  view,  the  limit  theorems  of 
information  theory  are  interesting  both  because  they  put  an  upper 
limit  on  what  is  achievable  and  because  the  limit  is  usually  not 
too  far  from  what  is  practically  achievable.  For  a  multiaccess 
channe 1 ,  however ,  t he  1 ong  time  i nt erva 1 s  r equ i red  for  t he  source 
arrivals  to  appear  smoothed  out  are  typically  far  greater  than 
the  tolerable  delays.  Conversely,  the  time  interval  required  for 
cod i ng  to  be  e f feet i ve  < i e .  t he  time  for  t he  no i se  to  be  smoot hed 
out >  is  typically  smaller  than  the  tolerable  delay.  What  is 
needed  then  is  an  information  theoretic  model  that  somehow 
precludes  the  possibility  of  imposing  long  delays  on  source 
messages. 

One  approach  to  this,  which  is  used  in  the  collision 
resolution  field,  is  to  assume  an  infinite  number  of  sources,  or 
equivalently,  that  a  new  transmitter  is  created  for  each  new 
arriving  message  and  then  destroyed  when  the  message  is 
successfully  transmitted.  The  received  sequence  or  waveform  would 
then  be  some  function  of  noise  and  whatever  was  being  transmitted 
by  the  active  transmitters.  It  seems  that  to  develop 
understanding  in  this  area,  it  is  necessary  first  to  develop  some 
understanding  of  coding  <as  opposed  to  coding  theorems)  in  a 
multiaccess  environment.  This  understanding  should  involve 
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decoding  in  the  presence  of  several  messages  being  transmitted 
simultaneously*  since  otherwise  the  problem  simply  reduces  to 
conflict  resolution  with  coding  added  for  reliable  transmission 
in  the  absence  of  conflicts. 

In  section  2*  we  discuss  multiaccess  information  theory  in 
more  detail*  and  in  section  3*  we  discuss  what  little  is  known 
about  coding.  In  both  sections*  the  discussion  is  restricted  to 
systems  with  only  two  sources.  It  appears  to  be  important  to 
understand  coding  in  this  simplest  context  before  tackling  the 
problem  of  real  interest  with  many  sources  and  transmitters. 

The  spread  spectrum  approach  to  multiaccess  channels  C  7  *  8  3 
will  not  be  discussed  in  any  detail  in  this  paper*  but  is  briefly 
discussed  here  in  order  to  illustrate  the  types  of  possibilities 
for  multiaccess  communication  that  lie  outside  the  conventional 
collision  resolution  and  coding  theory  approaches.  Spread 
spectrum  is  a  mode  of  communication  originally  developed  to 
protect  against  jamming  in  a  military  environment.  The  signal  to 
be  transmitted  is  modulated  over  a  much  broader  frequency  band* 
say  P  times  more  than  necessary.  Assuming  that  the  jammer  does 
not  know  the  modulating  sequence*  the  jammer's  signal  will 
essentially  look  like  broad  band  noise  to  the  signal*  and  the 
noise  seen  by  the  receiver  after  demodulation  will  be  reduced  by 
a  factor  of  p. 

For  multiaccess  communication  using  spread  spectrum*  several 
sources  can  transmit  at  once  using  different  modulating 
sequences*  and  each  will  look  like  broad  band  noise  to  the 


others.  I f  we  compare  this  type  of  system  to  frequency 


multiplexing?  using  p  frequency  bands?  it  appears  at  first  that 
spread  spectrum  is  not  a  very  good  idea.  When  a  number  of 
transmitters  approaching  p  transmit  together  using  spread 
spectrum?  the  self  noise  becomes  considerable?  and  the  resulting 
system  is  clearly  inferior  to  FDM  in  terms  of  capacity.  The 
problem  with  FDM?  however?  is  that  if  there  are  many  more  than  p 
transmitters  in  the  system?  but  typically  many  fewer  than  p  with 
messages  to  send?  there  is  a  problem  allocating  the  frequencies 
to  the  busy  transmitters  <this  is  the  same  fundamental  problem 
handled  by  the  collision  resolution  approach).  Since  many  times 
more  than  p  modulation  sequences  can  be  chosen  that  are  almost 
orthogonal  and  look  like  noise  to  each  other?  spread  spectrum 
provides  an  automatic  solution  to  the  problem  of  allocating  the 
channel  to  the  busy  users.  This  solution  is  not  entirely 
satisfactory?  since  one  still  needs  collision  resolution  when  too 
many  transmitters  send  at  once?  and  the  decoding  is  very  complex. 
It  illustrates?  however?  a  major  point  of  this  paper  -  namely 
that  a  more  fundamental  approach  and  set  of  models  are  needed  for 
multiaccess  communication  than  the  collision  resolution  or 
information  theoretic  approaches  alone. 


.<3c?r  ?  r 
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I I .  The  Information  Theoretic  Approach 

The  coding  theorems  of  information  theory  treat  the  question 
of  how  much  data  can  be  reliably  communicated  from  one  point*  or 
set  of  points*  to  another  point*  or  set  of  points.  It  is  tacitly 
assumed  that  the  sources  haoe  a  newer  emptying  reservoir  of  data 
to  send.  Thus  the  theoretical  results  in  this  area  do  not 
address  the  question  of  the  delay  that  arises  in  multiple  access 
systems  because  of  the  random  arrival  times  of  data  to  be 
transmitted. 

The  class  of  channels  to  be  considered  is  illustrated  in 
Fig.  2.1.  Each  unit  of  time*  the  first  transmitter  sends  a 
symbol  x  from  an  alphabet  X  and  the  second  transmitter  sends  a 
symbol  w  from  an  alphabet  Ul.  There  is  an  output  alphabet  Y  and  a 
transmitter  probability  assignment  P<y|xw>  determining  the 
probability  of  receiving  each  ycY  for  each  choice  of  inputs  xsX* 
and  wcUf.  The  channel  is  memory  less  in  the  sense  that  if  x  = 
<xj*...*x^>  and  t*  *  (uj*...*u^>  represent  the  inputs  to 
transmitters  one  and  two  respectively  over  N  successive  time 
units*  then  the  probability  of  receiving  =  <y1*...*yN>  for  the 
given  x*w*  is 

N 

P<»|  mm>  «  IT  P<yjx  w  >  <2. 1  > 

'  «  n 1  n  n 

n«l 

We  assume  for  the  time  being  that  the  alphabets  are  all  discrete* 
but  it  will  soon  be  obvious  that  this  can  be  generalized  in  the 
same  way  as  for  single  input  channels. 


As  indicated  in  the  figure*  there  are  two  independent 


sources  which  are  encoded  independent ly  into  the  two  channel 
inputs.  Consider  block  coding  with  a  gi wen  block  length  N  and 
with  M  code  words*  <Xj » Xg*  •  ■  •  * Xj*|>»  ^c,r  ^r^smitter  1  and  L  code 
words  *  . .  .  * *n_>  for  transmitter  2;  each  code  word  is  a  sequence 
of  N  channel  inputs.  For  convenience  we  refer  to  a  code  with 
these  parameters  as  an  <N,M,L>  code.  The  rates  of  the  two 
sources  are  defined  as 

Rj  -  <  In  M>/N,  R2  -  <  In  L>/N  <2.2> 

Each  M  units  of  time*  source  1  generates  an  integer  m  uniformly 
distributed  from  1  to  M  and  source  2  independently  generates  an 
integer  H  uniformly  distributed  from  1  to  L.  The  transmitters 
send  X,,,  and  respect i vely,  and  the  corresponding  channel  output 

V  enters  the  decoder  and  is  mapped  into  a  decoded  "message''  m»£. 
If  both  m  *  m  and  A  «  4»  the  decoding  is  correct  and  otherwise  a 
decoding  error  occurs.  The  probability  of  decoding  error*  Pe  is 
minimized  for  each  v  by  a  maximum  likelihood  decoder*  choosing 
<m,A>  as  integers  IS  m'  £  II*  1  S  *.*  S  L  that  maximize 
P<tf|x^« •«£/  >.  If  the  maximum  is  non-unique*  any  maximizing 
<m'»*'>  can  be  chosen  with  no  effect  on  P^.  Both  sets  of  code 
words  <X| * . . .  * x^>  and  -Cwj * . . .  * are  known  to  the  decoder*  but* 
of  course*  the  source  outputs  m,A  are  unknown. 

The  most  fundamental  result  about  these  channels  is  the 
coding  theorem  due  to  Ahlswede  C2D  and  Liao  C33.  Let  QjCx)  and 
Q2<w)  be  probability  assignments  on  the  X  and  Ul  input  alphabets 
respect ively.  Define  the  achievable  rate  region  R  as  the  convex 
hull  of  the  set  of  rate  pairs  <Rj»R2>  which*  for  some  choice  of 


* 
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» 

‘  assignments  satisfy  each  of  the  inequalities: 


'  P  <  y  j xw > 

R  +  R._,  S'TCXWS Y>  =  ■>  Q  <x>Q  <w>P<y|xw>ln  - 

1  *■  x,w,y  P<y>  <2.3> 


__  P  <  y | xw  > 

0  S  R  S  I<X;Y|UI>  *  >  Q  <x>Q<?<w>P<y|xw>ln  - 

1  x*  w* y  P < y | w >  <2.4> 


P  <  y  |  xw  '• 

0  <  R_  S  I<W*Y|X>  *  "5  Q.<x>Q  <w>P<y|xw>ln  - 

x  *  w  *  y  P  <  y | x  >  <  2 . 5  > 

where  P<y>  *  Zxw  Qi <x>Q2<w)P<y |xw>,  P<y|w>  »  Z;<  Qj <x)P<y | xw> ,  and 
P<y|x>  -  Zw  Q2<w>P<y|xw>. 

The  region  bounded  by  <2.3>-<2.5>.  for  a  given  Qj*Q2  is  shown 
in  fig.  2.2.  It  is  easy  to  see  that  the  break  points  of  the 
boundary  occur  at  Rj  ■  I<X*Y|W>»  R2  ■  I<W|Y>  and  at  Rj  *  I<X;Y>* 
R2  -  I<W*Y|X>.  In  general  I<X*Y|W>  >  I<XjY>  with  equality  iff  x 
and  w  are  conditionally  independent  given  y. 

Theorem  2.1  < Ahlswede* Liao) »  For  each  e  >  0*  6  >  0*  <R1»R2>«R» 
there  exists  an  N0  such  that  for  all  N  >  NQ*  M  <  exp  I'KRj-^j  L  £ 
exp  N<R2-$>»  there  exists  an  <N»M,L>  code  with  Pe  S  e.  For  each 
6  >  0  and  <Rj»R2>€R*  there  exists  s  >  0  such  that  P^,  2  s  for  all 
<N»M»L>  codes  with  M  2  exp  N<Rj+&>*  L  2  exp  N<R2+&>. 

In  effect*  the  theorem  says  that  reliable  communicat ion  is 


possible  for  source  rates  in  the  interior  of  the  achievable 


region  arid  is  impossible  outside  of  the  achievable  region. 

Slepian  and  Wolf  C93  later  generalized  this  result  by  considering 
a  third  source  that  could  be  encoded  jointly  for  both 
transmitters.  They  also  used  a  random  coding  argument  which 
showed  both  that  Pe  can  be  made  to  decrease  exponentially  with  N 
and  also,  in  a  sense,  that  most  codes  have  this  behavior.  Since 
this  random  coding  argument  is  a  very  simple  extension  of  random 
coding  for  single  input  channels  and  it  gives  a  great  deal  of 
insight  into  coding  for  multiple  access  channels,  we  now  go 
through  the  argument  for  the  two  source  case. 

Let  Qj<x>  and  G>2<w:>  be  probability  assignments  on  the  X  and 
W  alphabets  respectively  and  consider  an  ensemble  of  <N,M,L> 
codes  where  each  code  word  1  <  m  £  M  is  independent  ly 

selected  according  to  the  probability  assignment 

N 

Qj<x>  =  IT  Q1<xn>,  x  =  <Xj,x2 . xN>  < 2 . 6 > 

n=»  1 

and  each  code  word  v^,  1  <  <  <  L  is  i ndependent 1 y  se 1 eet ed 
accord i ng  to 

H 

■  TF  Q^w  w  =  <Wj,...,w^>  <2.7> 

n=  1 

For  each  code  in  the  ensemble,  the  decoder  uses  maximum 
likelihood  decoding,  and  we  want  to  upper-  bound  the  expected 
value  P#  of  P#  for  this  ensemble.  Define  an  error  event  to  be  of 
type  1  if  the  decoded  pair  <m,£>  and  the  original  source  pair 
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i 

<m,A>  satisfy  m '  s*  m,  A  =  .0. .  Ari  error  event  is  type  2  if  m  =  m 
and  A  *  and  is  of  type  3  i  f  m  ;*  m  and  A  i*  t.  Let  Pe^,  i  <  i  t 
3*  be  the  probability!,  over  the  ensemble!,  of  a  type  i  error 

e*  •  * 

event ;  ob  v i ous 1 y  Pe  =  pel  +  pe2  +  pe3‘ 

Consider  P^3  first.  Note  that  when  <m,£>  enters  the 
encoder,  there  are  M-l  choices  for  m  and  <L-1>  choices  for  A,  or 
<M— 1 > CL— 1 >  pairs,  that  yield  a  type  3  error.  For  each  such  pair 
<m,A>,  the  code  word  pair  x^,  *i£  is  statistically  independent  of 
x,^,  over  the  ensemble  of  codes.  Thus,  regarding  <x,  v>  as  a 
combined  input  to  a  single  input  channel  with  input  alphabet  XxW, 
we  can  directly  apply  the  coding  theorem,  theorem  5.6.1  of  C103, 
which  asserts*  that  for  all  p,  0  S  p  S  1, 

P#3  <  C<M-1XL-1>]P  ]>  £>  Q j  <x>Q2<*#>P<%#| xh)  <2.8 

y  x.  u 


’"The  statement  of  thoerem  5.6.1  of  C103  assumes  that  all  code 
words  are  chosen  independently,  but  the  proof  only  uses  pairwise 
independence  between  the  transmitted  word  <<xm,w>  in  the  case 
here>  and  each  other  word  <<x^,*i£>  m  *  m,  A  0  A  for  the  case 
here> . 


Using  the  product  form  of  Qj,  C^j  and  P,  Eqs.  <2.1!.  2.6, 

2.7 >,  and  the  definition  of  rates  in  <  2 . 2  ) ,  this  simplifies  to 

P#3  <  expCpN<R1+R  >3  Q  <x>Q  <w>P<y | xw)  1/a  +  f  >l  1  +  pl  N 

•-y  ‘•x,w  J  ■*  < 2 . 9 > 

Next  consider  Pel,  the  probability  that  m  ^  m  and  t  *»  ft.  We 
first  condition  this  probability  on  a  particular  message  t 
entering  the  second  encoder,  and  a  choice  of  code  with  a 
particular  Vp  transmitted  at  the  second  input.  Given  v^,  we  can 
view  the  channel  as  a  single  input  channel  with  input  ><„,  and  with 
transition  probabi  1  it ies  P<y|xm*ift>. 

A  maximum  likelihood  decoder  for  that  single  input  channel 
will  make  an  error  <or  be  ambiguous>  if 

P<|y|x^/wA>  2  PCylx  for  at  least  one  m'  *  m.  <2. 1©> 

Since  this  event  must  occur  whenever  a  type  1  error  occurs,  the 
probability  of  a  type  1  error,  conditional  on  is  upper bounded 
by  the  probability  of  error  or  ambiguity  on  the  above  single 
input  channel.  Using  theorem  5.6.1  of  C103  again  for  this  single 
input  channel,  we  have,  for  any  p»  0  S  p  <  1, 


<2. 11) 


PCType  1  error <  <M— 1>P 


5  Q1<x>P<«#j 


1  / 1. 1  +  p  ) 

XU  > 


1+P 


Taking  the  expected  value  of  <2.11)  over  and  then  using  the 
product  form  of  ancl  P  again. 


1/<1+p)' 

1  +  P 

P  «  <  expCpNR.l 
el  1 

^  Q2<w) 

^  Q^<x)P<y|xw) 

y,  w 

X 

Applying  the  same  argument  to  type  2  errors,  for  all  p,  0  <  p  < 

1, 


e2 


1<1  +  p)' 

1  +  P 

<  expCpNR^D 

]>  Q.<x> 

^  Q2<w)P<yJxw) 

y.x 

w 

N 


<2. 13> 


Putting  <2. 9),  <2.12),  <2.13)  in  a  form  to  emphasize  the 
exponential  dependence  on  N,  we  have: 

Theorem  2.2  <Slepian-Uol f ) *  Consider  an  ensemble  of  <N,M,L) 

codes  in  which  <Xj . and  are  independent  ly 

chosen  according  to  <2. 6)  and  <2. 7)  for  a  given  probability 
assignment  Q<xw)  *  Qj<x)G>2^w^*  Then  the  expected  error 
probability  over  the  ensemble  satisfies 
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sxp 


for  all  p*  0  £  i 
all  i  =  1*2*3. 


:.  is> 


Eo3<p,Q> 


1/<1+P>  1+p 

^  Q j <x)Q2<w>P<y | xw> 

X,  w 


<2. 19) 


Th*  behavior  of  th*  expressions  Eoi<p,Q>,  i  *  1*2*3*  is  the 
same  as  for  th*  single  input  cast.  In  particular  let  I j  *  i  ■ 
1,2,3,  be  given  by 


Ij  -  I  <X|  Y | W> ,  I2  -  I<W|Y|X),  I3  -  I<XW»Y> 


<2.20) 


as  defined  in  <2.3>-<2.5>.  Then  if  1^  >  0*  th*  function  E0^<p*Q> 
is  convex  n*  strictly  increasing  in  p*  and  positive  for  p  >  0. 
Furthermore*  the  maximum  of  E0j<p*Q)-pR4  over  0  S  p  S  1  is 
positive  and  decreasing  in  for  8  S  Rj  <  Ii  (see  theorem 


•  * 

-2.9- 

and  5.6.4  of  C103  for  proofs>.  Theorem  2.2  then  asserts  that  if 
<  Ij,  i  =  1,2,3,  then  P  ^  decreases  exponent i a  1 ly  with 
i ncr eas i ng  N . 


There  are  two  questions  we  want  to  explore  in  the  rest  of 

e*  •  * 

this  section.  First,  how  tight  is  this  bound  on  error 
probability,  and  second,  what  indication  does  it  give  of  the 
practicality  of  coding  for  multiple  access  channels.  To  explore 
the  question  of  tightness,  we  first  interpret  the  terms  Pdi  in 
<  2.  14  >  . 

P^l*  as  upper  bounded  in  <2. 12>,  is  the  error  probability 
that  would  result  if  a  "genie"  informed  the  decoder  about  the 
second  source  message  0. .  This  gen i e  a i ded  error  probab i  1  i t y  is 
also  clearly  a  lower  bound  to  Pe,  so  that  when  type  1  errors  are 
the  predominant  cause  of  errors,  the  genie  aided  error 
probability  closely  approximates  P#.  Similarly,  the  bound  for 
P#3  is  the  conventional  single  input  random  coding  bound  for  a 
single  code  of  rate  Rj+F^  using  combined  inputs  with  probability 
Ql<x)G>2<u>.  Our  conclusion,  then,  is  that  the  bound  on  Pe  in 
theorem  5.2  is  quite  tight  for  the  given  ensemble  of  codes.  The 
problem,  as  we  shall  soon  see  through  a  set  of  examples,  is  that 
the  best  codes  are  not  always  representat ive  of  the  ensembles. 
Example  1*  The  Collision  Channel. 

Let  X  =  <0,1,... K>  and  W  *  <0,1,...,  K>.  Ule  regard  0  as  an 
“idle”  input,  and  if  0  is  the  x  input  for  a  given  w  input,  then  y 
is  the  pair  <0,w).  Similarly  if  w*0,  the  output  is  <x,0>. 

Finally  if  x  *  0  and  w  *  0,  the  output  y  is  a  special  symbol  e 


representing  "collision” 


This  is  shown  in  fig.  2.3  for  K»; 


First  consider  the  achievable  rate  region.  For  any  given 
Q  2  v.  ■' ,  Q2 <  v  >  .  it  is  easy  to  see  that,  conditional  on  the  output  y, 
the  two  inputs  are  statistically  independent;  thus  I < X ; V j  W  >  = 
I<X;Y>  and  the  set  of  rates  satisfying  <2.3>-<2.5>  forms  a 
rectangle.  We  next  want  to  find  the  set  of  rates  so  that  <2.3>- 
<2.5>  is  satisfied  for  some  choice  of  Q^,Q2«  It  should  be  clear 
from  symmetry  that  Qj<x>  should  be  constant  for  all  x  >  0  and 
02(w>  should  be  constant  for  w  >  0;  thus  we  need  only  consider 
the  union  of  rates  satisfying  <2.3>-<2.5>  over  all  choices  of 
Qj<0>  and  Fig.  2.4  shows  the  resulting  union;  for  all  K  > 

8,  the  set  of  rates  is  non-convex  (the  potential  non-convexity 
for  multi -access  channels  was  first  shown  by  Clll).  The  convex 
hull  of  this  union  region  is  the  set  of  achievable  rates  of 
theorem  2.1.  Theorem  2.2  assures  us  that  exponent i a 1 1 y  decaying 
error  rates  are  achievable  in  the  interior  of  the  union  region. 
Any  given  rate  pair  in  the  interior  of  the  convex  hull  is  on  a 
straight  line  between  two  pairs  of  rates  each  in  the  interior  of 
the  union  region.  By  time  division  multiplexing  between  codes  for 
these  rate  pairs,  reliable  communicat ion  is  achieved  for  the 
given  rate  pair.  Thus  theorem  2.2  establishes  the  positive  half 
of  theorem  2.1. 

It  is  rather  surprising  at  first  that  the  union  region  is 
non-convex.  We  note  that  ICXW5Y>  is  a  convex  n  function  of  Qj<x> 
and  a  convex  n  function  of  Q2<x>,  but  is  non-convex  as  a  joint 
function  of  Qj  and  Q 2 •  It  is  also  convex  as  a  function  of 
Q<x,w)»  but  the  set  of  probability  vectors  GKx,w>  for  which 
GKx,w>  *  Q,<x>Qo<w>  for  some  Q,,Qo  is  a  non  convex  region.  Thus 
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£  .  <p,x,q''°,q<2>> 

01 


XE  .  C  p,  Q<  1  y  >  +  <  1  -X > E  .  <  p,  Q<2'’  > 
ox  ox 


< 2 . 25  > 


Ho  examples  have  been  found  where  this  approach  enlarges  the 
r eg i ons  defined  above;  this  approach  is  sufficient,  however, 
to  achieve  exponential  decays  in  F*e  for  all  rate  pairs  in  the 
interior  of  R. 

Another  approach  is  to  consider  random  coding  ensembles  in 
which  successive  letters  are  statistically  dependent.  For  the 
collision  channel,  for  example,  suppose  the  block  is  divided  into 
sub-blocks  of  four  letters  each.  Within  each  sub-block, 

<Xj , X2» Xg, x4>  has  either  the  form  <x,xj0,©>  or  the  form 
<0,0, x,x),  each  with  equal  probability.  Similarly,  <wj , Wg, Wg, w^> 
has  either  the  form  <w,o,w,0>  or  <0,w,0,w)  with  equal 
probability.  Finally,  x  and  w  are  independent ly  and  equiprobably 
chosen  from  Cl, 2, . ..,K).  With  this  arrangement,  each  sub-block 
is  equivalent  to  a  noiseless  x  channel  with  2K  inputs  and  a 
noiseless  w  channel  with  2K  inputs  <this  example  was  suggested  by 
Massey's  coding  scheme  for  unsynchronized  collision  channels 
C143).  The  resulting  random  coding  exponent  is  clearly  larger 
than  that  where  the  successive  letters  are  independent  with  the 
same  marginal  probabi 1 it ies. 

The  purpose  of  the  above  discussion  was  not  to  find  the 
largest  exponents  achievable  for  the  collision  channel,  but 
rather  to  illustrate  why  error  exponents  are  far  more  complicated 
for  multiaccess  channels  than  for  single  input  channels.  It  also 
illustrates  why  there  is  no  simple  sphere  packing  lower  bound  to 
for  multiaccess  channels  that  yields  the  same  error  exponents 


as  the  random  coding  bound.  ftrutyunyan  C  153  has  developed  a  type 


of  sphere  packing  bound  for  multiaccess  channels?  but  it  is 
somewhat  loose  since  it  does  not  account  for  the  separation  of 
the  two  encoders  for  the  type  3  errors. 

Example  2:  Additive  White  Gaussian  Noise  Channel  <AWGN> 

We  now  turn  to  another  example  of  somewhat  greater  practical 
importance  where  the  random  coding  exponents  work  out  more 
nicely.  Suppose  the  X»  W,  and  Y  alphabets  are  each  the  set  of 
real  numbers?  and  the  output  y  is  given  by 

y  *  x  +  w  +  z  <2.26  > 

where  z  is  a  zero  mean  Gaussian  random  variable  of  variance  <r2 
independent  of  x  and  w.  The  x  input  and  w  input  are  each 
constrained  to  have  mean  square  values  at  most  and  S2 
respectively.  1 f  we  consider  the  channel  as  a  cascade  of  a 
noiseless  channel  adding  x  and  w  and  then  a  single  input  Gaussian 
channel?  we  see  that  I<XWjY>  is  at  most  the  capacity  of  the 
single  input  channel  with  the  input  constrained  to  energy  Sj+S2. 
Thus 


I<XW|Y> 


log  1  + 


V  S21 
a2  J 


<2. 27 > 


It  is  also  easy  to  see  that  I<X|Y|W>  is  the  average  mutual 
information  between  x  and  y  in  the  absence  of  w.  Thus 


T.7.v  -T.'.V.'T.; 


I<X; Y|W> 


log 


[* 


<2. 28) 


ICWiY|X> 


log  1 


<  2 . 29  > 


These  inequalities  are  satisfied  for  all  independent 
distr ibut ions  on  x  and  w  and  are  all  satisfied  with  equality  if  x 
and  w  are  independent  zero  mean  Gaussian  with  variances  Sj  and  S2 
respectively.  Thus  the  rate  region  for  which  <2.3>-<2.5>  are 
satisfied  for  some  independent  x  and  w  distribution  is 


Rl+  R2 


S  |  log 


1  + 


VS2 


<2. 30> 


0  S  Rs 


S  |  109  1  -  2| 

<7 


■i*  *  a 


<2. 31 > 


0  S  R_  S  ~  log] 

2  L  a 


■K-§] 


<2.32) 


Since  this  region  is  convex  already*  it  is  the  achievable  rate 
region  R. 

This  region  R  is  sketched  in  fig.  2.5  for  various  values  of 
signal  to  noise  ratios  A  ■  $/o^,  s  *  Sj+S2»  for  the  case  where 
Sj  ■  S2.  Note  that  the  region  is  almost  rectangular  for  small  A 
and  almost  triangular  for  large  A.  Note  that  if  one  uses  TDM 
between  a  code  for  x  and  a  code  for  w»  then  the  achievable  rates 


are  limited  to  the  region  bounded  by  the  straight  line  between 
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the  axis  intercepts  of  the  boundary  of  R  <see  fig.  2.5).  Thus 
for  large  A*  TDM  is  almost  as  good  as  the  best  coding*  whereas 
for  small  A,  TDM  is  quite  inferior.  The  reason  for  this  can  be 
seen  most  clearly  for  the  case  Rj  =  R'2 •  Alternating  between 
<Rj*0>  and  <05 R2>  then  wastes  half  the  awailable  power*  since  <by 
our  model  >5  the  first  transmitter  stays  within  its  power- 
limitation  while  transmitt ing.  Losing  half  the  available  power 
loses  only  a  small  fraction  of  the  available  capacity  for  large  A 
whereas*  for  small  A*  a  large  fraction  is  lost.  This  suggests 
using  frequency  division  multiplexing*  achieving  the  same 
simplicity  as  TDM*  but  being  able  to  use  all  the  available  power 
<see  fig.  2.6). 

Next  consider  the  random  coding  exponent  for  these  channels. 
Using  the  above  Gaussian  distribution  for  x  and  w*  we  can  easily 
calculate  Eoi<P»Q>  from  <2. 17)-<2. 19> *  replacing  sums  with 
integrals.  The  result  is 


E  .<P,G»  * 

Ol 


-  m[.  *  -A-] 

2  L  <T  <1  +  P>J 


<2. 33 > 


where  S3  ■  Sj  +  S2*  Letting  Aj  »  S^cr2*  we  can  maximize 
CEol < p,Q>-pR1 3  over  p  to  get  the  parametric  equations 


vr--.-' 


r  i  a 


2  <  1  +  p  .  Xl  +  p.+ft.  > 
1  11 


0<P.S1 


i  r  a  .  i  p .  Aj 

Ri  =  2  ln[1  +  T+7~j  '  2a  +  P.Xl  +  P.+A.) 


<  2 . 34  > 


For  rates  lower  than  those  where  p^  =  1, 


<2. 35> 


As  in  <2.22>  and  <2.23>,  the  random  coding  exponent  Er<CRlfR2>  is 
the  minimum  of  E^CRj}  over  i  •  1,2,3.  The  region  R  divides  into 
three  subregions  as  shown  in  fig.  2.7  where  E^CR^  for  each  i  is 
dominant.  As  the  rates  decrease*  the  error  probability  of  type  3 
errors  decreases  more  rapidly  than  that  for  type  1  and  2  errors, 
so  that  for  small  rates  the  bound  is  dominated  by  errors  in 
source  1  or  2  but  not  both. 

For  a  single  input  additive  Gaussian  noice  channel,  choosing 
a  coding  ensemble  with  the  Gaussian  distribution  is  not  quite  the 
best  thing  to  do.  The  best  distribution  results  from  a  shell 
constraint!  that  is,  code  words  are  chosen  with  a  Gaussian 
distribution  condi t ional  on  the  resulting  word  having  an  energy 
very  close  to  HSj.  This  distribution  (see  section  7.4,  C103) 
yields  the  same  exponent  to  Fe  as  the  sphere  packing  bound  for 


rates  sufficiently  close  to  capacity 


For  a  multiaccess  channel,  it  seems  reasonable  to  again 


consider  a  random  coding  ensemble  using  a  shell  constraint  on 
each  set  of  code  words.  From  the  genie  interpretat ion  of  type  1 
and  2  errors,  we  see  that  is  upperbounded  by  the  probability 

of  error  for  the  first  set  of  code  words  with  the  additive 
Gaussian  noise  but  without  the  second  set  of  code  words.  Thus, 
for  i=l,2,  we  have  Pei  £  aiN  expC-NEr^ <Rj > 3 .  where  from  section 
7.4  of  C 101 .  ai  is  a  constant  and  Eri<Ri>  is  given  by: 


E 


.  CR.  > 


r  i  l 


1 

2 


InCe.-T. > 

l  l 


<2. 36  > 


for 


5  ln[ 


a/4X2+A.+ 


4+A 


>] 


R. 

i 


lnd+A. > 

l 


<2. 37  > 


where 


A  .  <  @ - 1  >  r  - rr - 

-■] 


Ti  = 


<2. 38  > 


i  i 


pj  *  exp<2  R^ 


<2.39> 


For  Rj  less  than  the  lower  limit  in  <2.37>» 


E  , <R. > 

ri  l 


A .  .  A . 

i  -  e,  +  -Jr*  +  k  lnce.ce, - -  R . 


i  2  2  i  i  2 


<2. 4© > 


For  rates  satisfying  C2.34),  the  sphere  packing  bound  for  the 


single  input  channel  gives  a  lower  bound 

P  .  5  exp  f-HCE  .  <■'  R .  >  +  o  <  H  >  D  l  ;  i  =  1 , 2 

ei  l  ri  l  J 

for  all  codes  where  o<N>  approaches  6  with  increasing  N. 

For  type  3  errors*  the  situation  is  less  simple  since  the 
combined  code  words  x  +  w  are  not  constrained.  In  fact*  if* 
after  constraining  x  to  have  energy  NSj  and  w  to  have  energy  NS2» 
we  then  constrained  jc to  have  energy  N<Sj+S2>*  we  would  then  be 
constraining  the  code  words  of  the  two  codes  to  be  orthogonal, 
which  is  just  a  generalized  version  of  the  frequency  division 
multiplexing  discussed  previously. 

We  now  develop  a  bound  on  P^g  using  a  shell  constraint  on 
the  code  words  x^  and  «#£.  Choose  each  x  independently  using  the 
density  Q|<x>  and  each  w  using  the  density  Q2<*>  where 


Q.  <x> 

x 


m . . <x> 

x  1 


N 

n 

n=  1 


<  2 .  42  > 


i‘  ^  <  X> 
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1 ;  for  Nb .  —  &  <  >  x  i  US . 

l  n  i 

n=l 


<.  2.43 


0;  otherwise? 


where  5-  is  art  arbitrary  positive  number?  and  is  a  normalizing 
constant  to  make  Qj<x>  integrate  to  1.  Substituting  <2.40}  for 
Q«<x>  and  ©2 ^  into  <2.8>?  replacing  sums  with  integrals?  and 
upper  bound i ng 


> .  (  x >  exp  C  r .  *  +  ;■  r .  <.  x  -S .  >  1 

l  l  x  n  i 

n=l 


<2. 44> 


For  any  r^  >  0?  i  *  1?2?  we  find  that  <2.8}  breaks  into  a  product 
form  (as  in  section  7.3  of  Cl©3}.  After  some  tedious 
integration?  we  get?  for  any  p?  05  p  $  1? 


[expCS<r  +r_>  . 

—  ^  ■-  J  "  WC-N<E03<p.r>-pR3>l 


<2. 45 > 


E03<p'r> 


<l  +  p>  In 


*  0102 


V®2  p 
+  |  11 


91  02-l 


< l  +  p>  < l-2r . S, > 


(2. 46 > 


<2. 47} 


The  first  term  in  <2.45>  is  proportional  to  +  for  any 
given  choice  of  r  ^ , r  2. ,  and  S,  so  we  simply  bound  it  by  aN^  for 
some  suitable  a.  The  exponent  can  be  optimized  over  p,  r^,  r-, 
(or  equivalently  over  p,  Sj,  8^  for  0  <  p  <  1,  8  S  S  l  +  p>. 

For  the  important  case  where  Aj  =  A2,  f,he  optimization  can  be 
carried  out  explicitly.  Here  by  symmetry,  the  optimal  9j  and  @2 
are  equal,  and  such  a  solution  is  also  valid,  but  not  optimal, 
for  all  Aj  and  A2.  Using  6  for  Sj  and  92  and  A  for  Aj  +  A2, 


0J 


<  l  +  p>  ln<~— >  -  9  +  £-  1  n <  1  + 

l  +  P  £.  8 


< 2 . 48  > 


Optimizing  the  exponent ,  we  find  that  for 


£  R0  £  \  ln< 1+A> ,  <2.49, 


WV  - 


<2.50> 


l+P- A  .  1 


<1+P>2+A2+2A 


a  at 


<2.5l> 


1  -  |  +  h  ln<A2+2A+4> 

2  2 


<2. 55) 


This  exponent  lies  roughly  half  way  between  the  previously 
der i ved  exponent  wit  hout  a  she 1 1  const ra i nt  and  t  he  exponent  w i t h 
a  shell  constraint  that  would  result  for  a  single  input  Gaussian 
channel  with  signal  to  noise  ratio  A  <i.e.  that  given  by  <2.36>- 
<2. 41 > . 

When  we  take  the  minimum  of  the  three  exponents  Eri<RA>  for 
i  *  1,2,3,  we  again  find  that  the  achievable  region  R  breaks  into 
3  subregions,  one  where  each  bound  is  dominant;  the  regions  look 
the  same  as  in  fig  2.7,  although  numerically  they  are  somewhat 
different.  We  now  know,  however,  that  whenever  the  rate  pair 
<Rj,R2>  is  in  R^Cor  R2>  and  Rj  (or  Rj)  is  above  the  critical  rate 
of  <2.36),  then  Er<Rj,R2>  is  indeed  the  exponent  for  optimal 
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codes.  For-  the  symmetric  case  where  R,  =  R.->.  the  region  R-. 
•■•anishes  for  small  ©rough  Rj  =  R-, ,  arid  if  the  point  where  Rp 
vanishes  is  above  the  critical  rat©  for  R.  and  R.-.,  then  the 
optimum  exponent-  is  given  by  <2. 37>-<2. 39 >  between  the  point 
where  R3  vanishes  and  the  critical  rate.  This  phenornen  occurs 
whenever  the  combined  signal  to  noise  ratio  A3  is  below  about  3. 


While  the  theoretical  development  of  coding  theorems 
for  multiaccess  channels  is  quite  advanced,  very  little  has  been 
done  with  respect  to  general  techniques  for  multiaccess  coding. 
As  pointed  out  in  the  introduct ion,  what  is  needed  is  a  coding 
technonology  applicable  where  there  are  a  large  set  of 
transmitters  but  only  a  small  subset  simultaneously  use  the 
channel  while  the  others  are  idle.  Here?  however  *  we  restrict 
ourselves  to  the  simpler  problem  of  the  two  input,  channel  of  fig 

2. 1  where  both  sources  always  have  something  to  send. 

First  we  observe  that  the  error  probability  bounds 
evaluated  in  the  last  section  apply  equally  well  to  ensembles  of 
linear  codes.  The  argument  for  this  is  the  same  as  in  section 

6.2  C103.  In  generals  binary  linear  codes  can  be  generated  for 
each  transmitter,  and  sub -blocks  of  these  binary  digits  can  be 
mapped  many  to  one  into  the  channel  input  alphabet,  thus 
achieving  any  desired  relative  frequency  of  utilization  of  the 
various  input  letters. 

Random  coding  bounds  for  convolut ional  codes  have  also 
been  generalized  from  single  input  channels  to  multiaccess 
channels  C163  with  the  same  type  of  large  exponent  as  occurs  for 
the  single  input  channel.  Thus  there  is  no  problem  generating 
good  codes,  either  block  or  convolutional.  The  problem,  as  with 
single  input  channels,  is  with  decoding. 

Before  discussing  decoding,  a  brief  discussion  of 
channel  modelling  is  in  order.  The  discrete  time  channels  dear 
to  the  hearts  of  information  theorists  implicitly  assume  that 
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carrier  phase  and  sampling  time  in  physical  channels  are  part  of 
the  channel  model.  Furthermore,  ideal  performance  of  these 
elements  is  usually  assumed.  For  single  input  channels  this 
separation  is  usually  perfectly  reasonable*  but  for  multiaccess 
channels  it  is  often  questionable.  For  example*  for  the  AWGN 
multiaccess  channel,  it  is  well  known  C173,C183  that  feedback  can 
increase  the  achieuable  ^1+^2  beyond  that  achievable  by  a  single 
source  of  rate  Rj+R-2  and  energy  constraint  S1+S2-  In  other 
words,  the  individual  transmitters  are  limited  to  Sj  and  S2 
respectively,  but  the  signal  energy  at  the  receiver  exceeds 
®1+S>2‘  This  means  that  the  two  transmitting  antennas  are  acting 
as  a  phased  array  and  that  the  additional  receiver  energy  comes 
from  antenna  gain  < along  with  very  clever  feedback  coordination). 
While  this  is  not  impossible,  it  is  certainly  not  a  conventional 
situation. 

Typically  we  should  expect  the  received  carrier  phase 
from  the  one  transmitter  to  be  roughly  independent  of  that  from 
the  other.  Approximate  baud  synchronism  between  the  transmitters 
is  slightly  more  reasonable  than  phase  synchronism  and 
approximate  block  synchronism  is  eminently  reasonable  with  only 
mar g i na 1  f eedbac k  commun i cat i on . 

There  appears  to  be  little  of  a  general  nature  that  can 
be  said  about  the  effect  of  asynchronism  between  the  sources  at 
the  phase  and  baud  level.  For  the  specific  case  of  an  AWGN 
channel,  however,  the  situation  is  much  simpler.  Using  a 
Gaussian  ensemble  <with  or  without  a  shell  constraint>  to 
generate  code  words,  the  discrete  time  code  words  of  the  last 


section  can  be  considered  as  time  samples  over  the  block  period 
of  a  narrow  band  stationary  Gaussian  process  with  alternate 
letters  representing  in  phase  and  out  of  phase  components.  Thus 
for  a  given  set  of  randomly  chosen  waveform  code  words .  a  change 
of  receiver  carrier  phase  and  sample  time  will  change  the 
discrete  time  code  but  will  not  change  the  ensemble  statistics 
< aside  from  some  end  effects  at  the  ends  of  the  block  which  we 
ignore).  The  decoder  must  know  the  relative  carrier  phase  and 
sample  time  for  each  of  the  two  transmitters  but  there  is  no  need 
for  the  two  to  be  synchronized  together.  In  summary .  the 
discrete  time  AUGN  multiaccess  model  of  the  last  section  is 
adequate  for  non- feedback  communication  maintaining  only  block 
synchronizat  ion.,  but  is  only  adequate  for  feedback  techniques  in 
the  rare  case  where  the  two  transmitters  are  phase  and  baud 
synchron i zed . 

The  problem  of  lack  of  block  synchronizat ion  for 
multiaccess  channels  is  somewhat  better  understood  than  that  of 
phase  and  baud  synchronizat ion.  Assuming  a  discrete  time  model 
Ci.e.  assuming  away  the  phase  and  baud  synchronizat ion  problem) . 
it  has  been  shown  C19D  that  with  a  bounded  amount  of  uncertainty 
in  timing  between  the  transmitters.  the  feasible  region  R  is  the 
same  as  with  perfect  synchronizat ion.  Essentially  one  uses  a 
coding  constraint  so  large  that  the  timing  uncertainty  becomes 
negligible.  For  complete  uncertainty  in  timing.,  on  the  other 
hand,  it  has  been  shown  C 20 1  that  the  feasible  region  is  the 
union  region  of  fig.  2.4  rather  than  its  convex  hull.  The 
essential  idea  here  is  that  time  sharing  cannot  be  used  in  the 


total  absence  of  relative  timing  between  the  transmitters. 

Having  cautioned  the  reader  about  the  modeling  problems 
inherent,  in  a  discrete  time  memory  less  model  of  multiaccess 
channels*  we  now  return  to  this  model  to  see  what  can  be  said 
about  coding. 

First,  there  is  a  fairly  simple  general  approach  that 
can  reduce  the  decoding  problem  to  several  single  source  decoding 
problems.  First  suppose  that  <Rj,R2>  satisfies  Rj  <  I<X?Y)W>,  R2 
<  IOrf;  Y>  for  some  assignment  Q^<x>,  G^w}.  Over  the  ensemble  of 
codes  using  G!j,GJ2*  a  decoder  can  decode  the  w  code  word  by 
ignoring  the  x  code  word  and  assuming  a  single  input  channel  with 
transition  probabi 1  it ies  P<y|w>  *  Qj <x>P<y | xw> .  Over  the 

ensemble  of  codes  for  the  first  encoder,  this  is  precisely  the 
set  of  transition  probabilities  from  w  to  y.  Thus  a  "good" 
decoder  for  a  single  input  channel  can  decode  v  reliably.  Given 
w,  another  decoder  for  a  single  input  channel  can  decode  x  using 
P<yn|xnwn>.  This  second  decoding  is  somewhat  unconvent i na 1  for 
single  inputs  in  that  the  transition  probabi 1  it ies  depend  on  wn 
and  thus  vary  with  n,  but  a  number  of  decoding  techniques  such  as 
sequential  decoding  and  Viterbi  decoding  can  deal  with  this 
situat ion. 

As  can  be  seen  from  fig.  3.1,  any  <Rj,R2>  in  the 
interior  of  the  achievable  region  of  <2.3>-<2.5>  for  a  given 
Q^,Q2  can  be  represented  as  a  convex  combination  of  two  rate 
pairs,  one  of  which,  <RJ,R£>,  satisfies 
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and  the  other  of  which  satisfies 


Rj  <  I(X; Y)}  R‘2  <  I(W;Y|X)  (3.2) 

Codes  for  each  of  these  rate  pairs  can  be  decoded  by 
the  two  step  procedure  described  above  and  <R^»R2>  can  be  decoded 
by  ti me  shar i ng  bet  ween  t  wo  such  codes . 

Finally*  any  point  in  the  interior  of  the  achievable 
rate  region  is  a  convex  combination  of  two  rate  pairs,  one  of 
which  satisfies  (2.3)-(2.5)  with  strict  inequality  for  some  0^2 
and  the  other  for  some  other  Thus  an  arbitrary  point  in 

the  interior  of  R  can  be  reliably  decoded  by  time  sharing  between 
at  most  4  codes*  two  of  which  use  rates  satisfying  (3.1)*  (3.2) 
respectively  for  Q^Q2  and  the  other  two  of  which  satisfy  (3.1), 
(3.2)  for  Q*02- 

This  approach  is  not  entirely  satisfactory  for  two 
reasons.  The  first  is  that  the  random  coding  exponents  for  error 
probability  in  this  approach  are  often  much  smaller  than  those 
for  joint  decoding  of  the  two  code  words  together.  If  we  use 
error  exponents  as  a  crude  measure  of  decoding  complexity*  this 
indicates  that  the  price  of  avoiding  joint  decoding  is  much 
greater  complexity  for  the  single  input  decoders.  Note*  however* 


that  error  exponents  can  sometimes  be  misleading  as  a  guide  to 
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decoding  complexity.  For  examp le,  the  random  coding  exponent  for 
a  noiseless  binary  channel  is  not  large,  whereas  coding  and 
decod i ng  are  comp 1 et  e 1 y  trivial. 

The  other  objection  to  this  approach  is  that  it  fails 
to  provide  much  insight  into  the  question  of  joint  decoding  of 
several  sources.  It  certainly  does  not  generalize  to  the  use  of 
a  small  but  unknown  subset  of  a  large  set  of  transmitters. 

A  second,  simpler  but  less  general,  approach  is  to 
decode  the  code  words  from  each  transmitter  i ndependent 1 y 
regarding  the  other  as  noise.  From  fig.  2.5,  it  is  seen  that  for 
the  AUIGN  channel  with  small  signal  to  noise  ratio,  the  achievable 
rate  region  is  almost  rectangular.  Analytically  I < X ; V >  = 

<  l/2>  InC  1  +  A1/<1+a2>]  which  is  close  to  XCX;Y|U>  *  <  l/2>  InC  1+Aj  D 
when  A2  is  small.  In  this  case,  the  error  exponent  for 
individual  decoding  is  almost  the  same  as  for  joint  decoding. 

This  second  approach  can  be  carried  one  step  further  by 
choosing  all  the  code  words  for  transmitter  1  to  be  orthogonal  or 
almost  orthogonal  to  all  those  for  transmitter  2.  This  is  the 
approach  taken  in  frequency  division  multiplexing,  and  has  the 
added  advantage  of  largely  eliminating  the  problems  caused  by 
relative  differences  in  carrier  phase  and  baud  timing  between  the 
t r ansm i t t er s . 

This  approach  is  also  used  in  spread  spectrum 
communicat ion.  This  has  the  added  advantage  of  allowing  a  large 
number  of  transmitters,  all  of  whose  code  sets  are  approximately 
orthogonal  to  all  the  other  code  sets.  When  only  a  subset  of  the 
transmitters  transmit  at  one  time,  the  interference  from  the 
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other  transmitters  is  reduced  and  the  individual  code  words  can 
be  successfully  decoded.  This  same  approach  has  been  used  by 
Cohen  et  al  E  21  3  and  Sommer  C223  in  the  context  of  multiaccess 
pu 1 se  pos i t i on  modu 1 at i on . 

For  an  arbitrary  discrete  time  memoryless  multiaccess 
channel,  perhaps  with  more  than  two  transmitters,  one  can 
similarly  investigate  ways  to  choose  code  word  sets  for  the 
individual  transmitters  in  such  a  way  that  they  are  mutually  non 
interfering  (more  precisely,  so  that  they  can  be  individually 
decoded  with  small  error  pr obabi 1 ity> .  Time  sharing  within  a 
code  word  i s  one  poss i b i 1 i t  y ,  but  depend i ng  on  t  he  channe 1 ,  ot  her 
possibilities  might  be  preferable,  as  we  have  seen  for  the  AWGN 
channel.  A  more  difficult  related  problem  is  to  choose  the  code 
word  sets  in  such  a  way  as  to  maintain  the  non-interference 
property  in  the  presen ce  of  lack  of  baud  synchronism  between  the 
transmitters.  Ule  have  seen  that  this  can  be  done  for  the  AUIGM 
channel,  and  Massey's  coding  scheme  C143  for  the  asynchronous 
collision  channel  also  achieves  this  objective;  at  present , 
however,  no  approaches  are  known  for  general  discrete  time 
memory less  channels. 

As  a  third  approach  to  decoding,  consider  true  joint 
decoding  of  the  two  code  words.  I  will  not  consider  algebraic 
decoding  techniques  here  since  an  algebraic  structure  must  be 
matched  in  some  sense  to  the  channel  character ist- ics  and  I  am  not 
aware  of  any  interesting  examples  of  general  algebraic  approaches 
for  multiaccess  channels.  Viterbi  decoding  of  convolut ional 
codes  is  another  possibility,  but  it  does  not  appear  very 


promising  is  a  joint  decoding  technique.  The  problem  is  that  the 
decoder  should  track  all  possible  states  of  both  encoders,  which 
leads  to  a  combined  number  of  states  which  is  the  product  of  the 
individual  numbers  of  states.  With  more  than  two  transmitters, 
the  problem  is  even  worse. 

Final ly,  sequential  decoding  appears  to  be  a  general 
approach  to  multiaccess  joint  decoding  and  it  has  been  shown  C233 
that  lack  of  block  synchronizat ion  is  not  a  serious  impediment  to 
its  operation.  Unfortunately,  at  this  time,  it  is  not  clear  how 
to  make  sequential  decoding  work  for  a  multiaccess  channel.  To 
explain  the  difficulty,  recall  that  sequential  decoding  is  a 
search  procedure  that  hypothesizes  the  encoded  sequence  up  to  a 
given  point  and  either  proceeds  forward  by  extending  the  encoded 
sequence  or  searches  backward  depending  on  the  value  of  a 
•’metric"  that  stochastically  drifts  upward  when  the  decoder  is 
following  the  actual  uncoded  sequence  and  drifts  downward  when 
the  decoder  gets  off  the  track. 

The  problem,  now,  is  that  the  decoder  can  go  off  the 
track  in  three  ways,  corresponding  to  the  three  types  of  errors 
in  section  2.  Unfortunately  the  appropriate  metric  to  use 
depends  on  the  type  of  error  being  made,  and  this  knowledge  is 
unknown  to  the  decoder.  It  appears  that  no  single  metric  is 
adequate  for  sequential  decoding  to  work  on  a  general  discrete 
memoryless  multiaccess  channel  with  bounded  expected  computation 
up  to  the  normal  computational  cut  off  rate.  It  might  be 
possible  to  develop  a  sequential  decoding  algorithm  that  utilizes 
several  metrics  simultaneously,  but  so  far  no  such  algorithm  has 


been  dev  i  sed 


Another  fundamental  problem  with  sequential  decoding 
has  recently  been  discovered  by  Arikan  C24 1 ,  Arikan  considers  a 
multiaccess  binary  erasure  channel  where  X  =  C0,1>.  U  =  £0,1}  and 
V  =  <  < 0 , 0 >  ,  < 0 ,  1  > ,  t;  l ,  0 } ,  <  l ,  1  >  ,  <e,e>}.  W i t h  probab i  1  i t y  1  -  s , 
for  some  s  >  0,  y  =  <x,w>,  whereas  with  probability  z, 
independent  of  the  input,  y  =  <e,e>.  In  effect  we  have  two 
erasure  channels  with  perfectly  correlated  erasures.  Using 
equi probable  inputs  for  each  transmitter,  we  can  formally 
calculate  the  computational  cutoff  region  RCl-imp.  for  a  .joint 
decoder  as 
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5>  is  the  active 

constraint,  and  even 

without  any  of  the  metric  problems * discussed  above,  <3.5>  limits 
the  achievable  rate  with  joint  sequential  decoding.  However- 
using  separate  sequential  decoders  for  the  two  transmitters  and 
ignoring  the  erasure  correlation,  we  can  achieve  the  higher  rates 
of  <3. 3)  and  <3.4>. 


To  make  the  situation  worsen  we  see  that  -  lnC < l+3s>/43  is 
also  the  computational  cut  off  rate  of  a  single  input  quaternary 
erasure  channel.  However,  by  regarding  the  inputs  to  the 
quaternary  channel  as  two  binary  digits  and  using  separate 
convolutional  encoders  and  decoders  for  the  two  digits?  we  can 
again  achieve  the  higher  rates.  The  difficulty  here  does  not 
reside  in  the  particular  search  algorithm  being  used.  Over  the 
ensemble  of  convolutional  codes  for  the  quaternary  input  (or 
pa i r s  of  codes  for  b i nary  i nput- s >  ,  the  expect ed  number  o f 
pot  ent i a 1  encoded  sequences  <  or  pa i r s  o  f  sequences )  at  1 engt  h  N 
wh i ch  are  as  likely  as  the  t r ansm i 1 1 ed  sequence  (or  pair}  is 
exponent ial ly  increasing  in  N  for  any  combined  rate  in  excess  of 
-lnC(l+3«)/4],  The  conclusion  that  one  must  reach  is  that  Rcomp 
is  not  really  a  fundamental  parameter  of  communication.  This 
same  example*  in  the  context  of  the  photon  channel,  has  been 
discussed  by  Massey  C253  and  Humblet  C26J. 

Summarizing  the  previous  approaches  to  decoding,  we  see 
that  much  more  research  is  necessary  before  any  cohesive  body  of 
knowledge  about  coding  and  decoding  for  multiaccess  channels  will 


4}  COLLISION  RESOLUTION 

As  briefly  discussed  in  the  introduction?  the  collision 
resolution  approach  to  multiaccess  communicat ion  focuses  on 
allocating  the  multiaccess  channel  among  a  large  set  of  users  at 
different  transmitting  sites.  It  has  the  weakness?  however?  of 
essentially  ignoring  the  communication  aspects  of  the  problems. 

We  start  by  a  set  of  assumptions  that  limit  the  class  of  systems 
we  will  be  considering. 

a>  Slotted  Systems  We  assume  that  each  message  < packet)  to  be 
transmitted  requires  one  time  unit  <a  slot}  for  transmission. 

All  transmitters  are  synchronized  so  that  all  transmissions  start 
at  an  integer  time  and  end  before  the  next  integer  time.  Such 
synchronization  is  usually  not  too  difficult  given  stable  clocks 
and  given  a  small  amount  of  timing  feedback  from  the  receiver. 

In  case  of  propagation  delays?  the  timing  is  relative  to  the 
receiver?  so  that  each  packet  starts  to  arrive  at  the  receiver  at 
an  integer  time.  Naturally  some  guard  space  is  required  in 
practice?  but  we  neglect  that  here.  Note  that  this  assumption 
precludes  both  the  possibility  of  sending  short  packets  to  make 
reservations  for  long  packets  and  of  carrier  sensing?  which  we 
discuss  later.  Such  systems  can  be  understood  much  more  simply 
after  this  basic  model  is  understood. 

b}  Collision  or  Perfect  Receipt  ions  We  assume  that  if  more  than 
one  transmitter  sends  a  packet  in  a  slot  <the  time  from  one 
integer  to  the  next}?  then  there  is  a  collision  and  the  receiver 
gets  no  information  about  the  contents  or  origins  of  the 


transmitted  packets 


If  just  one  transmitter  sends  a  packet  in  a 


slot,  it  is  received  with  no  errors.  This  is  the  assumption  that 
remooes  the  noise  and  communication  aspects  from  the  problem;  it 
allows  collision  resolution  to  be  studied  in  the  simplest  context 
but  also  severely  limits  the  class  of  strategies  and  tradeoffs 
that  can  be  considered. 

c>  Infinite  Set  of  Transmitt erst  Assume  that  each  arriving 
packet  arrives  at  a  transmitter  that  has  never  previously 
received  a  packet.  This  precludes  queueing  at  individual 
transmitters  and  precludes  the  use  of  TDM.  This  is  an 
unreasonable  assumption  from  a  practical  point  of  view,  but  note 
that  given  any  algorithm  determining  when  the  transmitters  send 
packets,  a  finite  set  of  transmitters  can  use  the  same  algorith,m 
regarding  each  packet  arrival  as  corresponding  to  a  separate 
conceptual  transmitter.  In  this  case,  a  real  transmitter  will 
sometimes  send  multiple  packets  at  the  same  time,  causing  a 
collision.  This  shows,  first  that  assumption  c>  provides  a  worst 
case  bound  on  a  finite  set  of  transmitters  and  second,  that  the 
difference  is  only  significant  when  two  or  more  packets  are 
waiting  at  the  same  transmitter.  Collision  resolution  algorithms 
are  primarily  useful  to  reduce  delay  over  what  would  be  achieved 
with  TDM,  so  in  this  low  delay  region,  having  multiple  packets  at 
a  transmitter  should  be  relatively  rare  and  the  performance  with 
a  finite  set  of  transmitters  shuld  be  well  approximated  by  the 
performance  with  an  infinite  set.  The  major  advantage  of  the 
infinite  set  assumption  is  that  we  can  use  the  maximum  throughput 
of  an  algorithm  as  a  qualitative  measure  of  the  goodness  of  the 
algorithm  without  allowing  for  the  somewhat  incidental 


improvements  of  throughput  that  could  be  achieved  when 


transmitters  have  multiple  packets  to  send. 

d>  Poisson  Arrivals:  Assume  that  new  packet  arrivals  are 
Poisson  at  an  overall  rate  X.  Given  assumption  3,  no  other 
arrival  process  would  make  much  sense. 

e)  Q 2  l_a _ c  Immediate  Feedbacks  Assume  that  by  the  end  of  each 

slot,  each  transmitter  learns  whether  0  packets,  1  packet,  or 
more  than  one  packet  <c  for  colli si on)  were  transmitted  in  that 
slot.  This  is  the  only  information  that  each  transmitter  gets 
about  the  existence  of  packets  elsewhere.  The  assumption  of 
immediate  feedback  is  often  unrealistic,  but  collision  resolution 
algorithms  can  usually  be  easily  modified  to  deal  with  delayed 
feedback;  the  introduction  of  delay  in  the  feedback,  however, 
seems  to  greatly  complicate  analysis  with  no  apparent  benefit  in 
insight.  The  assumption  of  0,  1 ,  c  feedback  implies  that  the 
receiver  <or  the  transmitters  themselves)  can  distinguish  between 
an  idle  channel  and  a  collision,  which  is  not  always  reasonable. 
It  also  implies  that  idle  transmitters  are  always  listening  for 
this  feedback,  which  is  not  always  desirable.  Some  alternative 
forms  of  feedback  will  be  discussed  in  what  follows. 

4.1  SLOTTED  ALOHA »  The  simplest  form  of  collision  resolution 
strategy  using  the  assumptions  above  is  Slotted  Aloha,  due  to 
Roberts  C273.  Slotted  Aloha  is  a  variation  of  pure  Aloha, 
devised  by  Abramson  Cl  3,  which  will  be  briefly  discussed 
subsequent ly.  In  slotted  Aloha,  whenever  a  packet  arrives  at  one 
of  the  transmitters,  that  packet  is  transmitted  in  the  next  slot. 
Whenever  a  collision  occurs  in  a  slot,  each  packet  involved  in 


the  collision  is  said  to  be  backlogged  and  remains  backlogged 
until  it  is  successfully  transmitted.  Each  such  backlogged 
packet  is  transmitted  in  each  slot  with  some  fixed  probability 
P  >  1,  independent  of  past  slots  and  of  other  packets.  Note  that 
if  p  were  1 ,  backlogged  packets  would  continue  colliding  and  no 
more  packets  would  ewer  be  successfully  transmitted.  Note  also 
that  because  of  the  effect iwely  infinite  set  of  transmitters*  the 
collision  cannot  be  resol  wed  by  transmitters  waiting  some  number 
of  slots  determined  by  the  identity  of  the  transmitter.  Such 
strategies  can  be  used  with  a  known  set  of  transmitters  and  can 
be  made  to  behawe  like  TDM A  under  heawy  loading. 

It  can  easily  be  seen  that  slotted  Aloha  can  be  analyzed  as 
a  homogeneous  Markow  chain,  using  the  number  of  backlogged 
packets  at  each  integer  time  t  as  the  state.  The  state  at  time  t 
includes  packets  that  collided  in  the  slot  from  t-1  to  t  but  does 
not  include  new  packet  arriwals  from  t-1  to  t.  Let  k  be  the 
state  at  time  t  and  k+i  be  the  state  at  t+1.  Note  that  i  can 
newer  be  less  than  -1  <i.e.  at  most  one  backlogged  packet  can  be 
successfully  transmitted  in  the  slot  Et,t+13.  Furthermore,  i  *  -1 
if  no  new  packets  arriwed  in  Ct-l,t>  and  exactly  one  backlogged 
packet  is  transmitted  in  Et,t+1>.  This  ewent  has  probability 
kp< l-p> *<-1e”^.  The  state  stays  the  same  <i=0>  e i t her  i  f  no  new 
packets  arriwed  in  Et-l,t>  and  no  backlogged  packet  is 
successfully  transmitted  in  Et,t+1>  or  if  one  new  packet  arriwed 
in  Et-l,t>  and  is  successfully  transmitted  in  Et,t+1> 

Analyzing  the  cases  i  >  0  in  the  same  way,  we  see  that  the 
state  transition  probabi 1 it ies  Pk> k+i  are  giwen  by 
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^k, k+i 


In  understanding  how  this  chain  behaves,  we  look  first  at  the 
drift,  Dk,  defined  as  the  expected  value  of  i  conditional  on  k 
<i.e.  the  expected  difference  between  the  state  at  t+1  and  that 
at  t  conditional  on  the  state  at  t>. 

Dk  »  X  -E<l-P>kXe~X  +  kp<l-P>k_1e“X3  <4.2> 

The  first  term  X  is  the  arrival  rate  and  the  second  term  is  the 
departure  rate  or  throughput.  Note  that  for  any  X  >  0  and  any 
P  >0,  Dk  will  be  positive  for  all  sufficiently  large  k.  This 
means  that  if  the  system  becomes  sufficiently  backlogged,  it 
drifts  in  the  direction  of  becoming  more  and  more  backlogged; 
this  should  not  be  surprising  since  collisions  occur  on  almost 
all  slots  when  the  backlog  gets  sufficiently  large.  Kaplan  C283 
gives  a  simple  but  elegant  proof  that  this  type  of  chain  is 
unstable  <i.e.  non-ergodic> . 

Despite  the  instability  of  slotted  Aloha,  it  can  still  be  a 
useful  collision  resolution  approach  especially  if  the  system  is 
modified  to  avoid  or  recover  from  the  heavily  backloggr.4  state. 

'Using  a  small  value  of  p  helps  postpone  the  onset  of  the 


catastrophic  behavior  above,  and  for  small  p,  <4. 2)  can  be  well 
approximated  by 

Dk  *  X  -  <X+pk>e_<X+pk>  <4.3 

Fig.  4.1  illustrates  this  equation.  For  X  >  e“*,  we  see  that  Dk 
>  0  for  all  k.  For  X  <  e~*,  there  is  a  range  of  k  for  which 

Dk  <0,  and  the  size  of  this  range  increases  as  X  decreases  and 

as  p  decreases.  Unfortunately,  X  is  the  arrival  rate  which  we 
would  rather  not  decrease,  and  small  p  means  large  delay  between 
retrials  of  a  collided  packet. 

This  tradeoff  in  p  is  very  undesirable;  large  p  makes  it 
very  easy  to  enter  the  unstable  heavily  backlogged  region, 
whereas  small  p  causes  large  delay  for  collided  packets  in  the 
stable  region.  The  engineering  solution  is  almost  obvious — 
change  p  as  the  backlog  k  changes.  Ideally,  we  would  like  to 

adjust  p  to  keep  X  +  pk  =  1,  thus  maintaining  a  throughput  of  e“ 

for  all  k  >0.  This  keeps  delay  small  when  the  backlog  is  small 
and  keeps  the  system  stable  if  X  <  e_1.  The  problem  with  this 
solution  is  that  k  is  unknown,  and  either  k  must  be  estimated 
from  the  feedback  or  an  apropriate  value  of  p  must  be  estimated. 
Hajek  and  VanLoon  Z291  have  analyzed  a  class  of  algorithms  in 
which  p  is  updated  at  each  slot  simply  as  a  function  of  the 
previous  p  and  the  feedback  information.  They  showed  that  such 
functions  cam  be  chosen  for  any  X  <  e-1  so  as  to  make  the 
resulting  system  stable. 

From  <4. 3),  we  see  that  Dk  is  positive  whenever  X  >  1/e. 
This  is  only  an  approximat ion  of  (4.2),  but  the  approximat ion  is 


good  when  p  is  sma 11,  and  p  must  be  sma 1 1  when  k  i  s  1 arge  t o 
minimize  D ^ •  Thus ,  for  X  >  1/e,  D^,  is  positive  for  all 
sufficiently  large  k  no  matter  how  p  is  chosen,  so  that  slotted 
Aloha  is  unstable  in  this  case  even  if  k  is  known. 

In  the  next  subsection  we  show  that  much  higher  throughputs, 
and  presumably  smaller  delays,  are  possible  when  newly  arriving 
packets  are  sometimes  held  up  and  collisions  are  resolved  in  more 
sophisticated  ways.  The  primary  advantage  that  slotted  Aloha  has 
over  these  more  sophisticated  strategies  is  that  slotted  Aloha 
does  not  require  all  the  feedback  information  we  have  assumed. 

For  many  physical  multiaccess  channels,  particularly  dispersive 
fading  channels,  it  is  difficult  to  distinguish  an  idle  slot  from 
a  collision  with  high  reliability.  It  is  usually 
straight  forward,  through  use  of  a  cyclic  redundancy  check,  to 
distinguish  a  successful  transmission  from  idle  or  collision,  and 
it  can  be  seen  that  this  kind  of  feedback  is  sufficient  for 
slotted  Aloha  but  not  sufficient  for  the  more  sophisticated 
strategies.  Unfortunately  it  is  much  more  difficult  to  estimate 
the  backlog  with  this  type  of  feedback  and  it  is  an  open  research 
problem  to  determine  whether  slotted  Aloha  can  be  stabilized  in 
this  case. 

Pure  Aloha  Cl  3  was  the  precursor  of  slotted  Aloha  and  avoids 
our  assumption  of  a  slotted  system,  although  we  continue  to 
assume  that  each  packet  requires  one  time  unit  for  transmission, 
that  overlapping  packets  collide,  and  that  assumptions  c>,  d>, 
and  e)  hold.  Each  newly  arrived  packet  is  transmitted 
immediately  upon  arrival  and  backlogged  packets  are  transmitted 


after  a  geometrically  distributed  delay.  The  probability  of 
collision  is  higher  here  than  in  a  slotted  system;  a  packet- 
starting  transmission  at  time  t  will  collide  with  other  packets 
starting  anywhere  in  the  internal  <t-l,t+l>.  The  upper  bound  on 
throughput  becomes  <2e>“*  and  the  sa-ne  kinds  of  stability  issues 
arise  as  for  the  slotted  system.  A  major  practical  advantage  of 
pure  Aloha*  however,  is  its  ability  to  handle  packets  of 
different  lengths  C  30 , 3 1 1 . 

4.2  SPLITTING  ALGORITHMS*  In  our  discussion  of  slotted  Aloha,  we 
saw  that  the  throughput  is  upper  bounded  by  1/e  regardless  of  the 
strategy  used  to  adjust  the  retransmission  probability  of 
collided  packets.  This  bound  was  imposed  by  the  restriction  that 
new  arrivals  were  always  transmitted  in  the  next  slot  after  their 
arrival  and  that  backlogged  packets  depended  upon  a  single 
parameter  p  for  retransmission.  To  get  an  intuitive  idea  of  why 
the  transmission  of  new  arrivals  should  sometimes  be  postponed, 
consider  a  slot  in  which  two  packets  collide.  If  the  new 
arrivals  were  held  up  until  the  collision  were  resolved,  then  a 
reasonable  strategy  would  be  for  each  colliding  packet  to 
retransmit  in  the  following  slot  with  probability  1/2.  With 
probability  1/2,  then,  a  successful  transmission  occurs  and  the 
other  packet  would  be  transmitted  in  the  following  slot. 
Alternatively,  with  probability  1/2,  another  collision  or  an  idle 
slot  ensues,  wasting  one  slot.  Again,  in  this  case,  each  packet 
would  be  transmitted  in  the  following  slot  independently  with 
probability  1/2,  and  so  forth  until  the  two  packets  are 
successfully  transmitted.  The  expected  number  of  slots  required 


to  successfully  transmit  the  two  packets  is  easily  seen  to  be  3* 
which  yields  an  effect i we  throughput  of  2/3  during  the  collision 
r eso 1 ut i on  per i od . 

This  concept  of  probaba 1 ist i cal ly  splitting  the  set  of 
packets  involved  in  a  collision  into  a  t r ansrn i 1 1 i ng  set  and  a 
non-transmitt ing  set  while  making  other  packets  wait  is  the 
central  idea  of  a  variety  of  collision  resolution  algorithms  that 
achieve  throughputs  larger  than  1/e  while  using  assumptions  a>  to 
e>;  we  call  these  algorithms  splitting  algorithms.  These 
algorithms  differ  in  the  rules  used  for  splitting  the  collision 
set  (which  might  involve  more  than  two  packets)  and  in  the  rules 
for  allowing  waiting  packets  not  involved  in  a  collision  to 
transmit  after  the  collision  is  resolved. 

The  first  splitting  algorithms  were  the  tree  algorithms 
developed  by  Capet anak is  C323,  Hayes  L333 ,  and  Tsybakov  and 
Mikhailov  C343.  In  these  algorithms*  the  system  alternates 
between  two  modes — normal  mode  and  collision  resolution  mode. 

When  a  collision  occurs  in  normal  mode*  all  transmitters  go  into 
collision  resolution  mode*  all  new  arrivals  wait  until  the  next 
transition  into  normal  mode*  and  all  packets  involved  in  the 
collision  independent ly  select  one  of  two  subsets  with  equal 
probability.  We  view  each  subset  as  corresponding  to  a  branch 
from  the  root  of  a  rooted  binary  tree  (see  fig.  4.2>.  In  the 
slot  following  the  collision*  the  first  of  these  subsets  is 
transmitted.  If  another  collision  occurs*  this  subset  is 
further  split  into  two  smaller  subsets*  corresponding  to  further 
branches  growing  from  the  original  branch.  The  first  of  these 


subsets  is  transmitted  in  the  the  next  slot,  and  if  this 
transmission  is  successful  or  idle,  the  second  of  the  subsets  is 
transmitted  in  the  following  slot.  In  general,  whenever  the 
transmission  of  a  subset  results  in  a  collision,  the  subset  is 
split  and  two  new  branches  of  the  tree  are  grown  from  the  old 
branch.  Whenever  the  transmission  of  a  subset  is  idle  or 
successful  Ci.e.  the  subset  is  empty  or  contains  one  packet >,  the 
next  slot  is  used  to  transmit  the  next  subset.  When  all  subsets 
have  been  exhausted,  the  normal  mode  is  again  entered. 

It  should  be  apparent  that  if  this  algorithm  spends  many 
slots  resolving  a  collision,  then  typically  many  new  arrivals 
will  eagerly  be  awaiting  the  return  to  normal  mode  and  a 
resounding  collision  will  ensue.  What  is  even  worse  is  that  many 
successive  collisions  will  follow  until  the  expected  number  of 
pakets  in  a  subset  becomes  on  the  order  of  1.  Thus  the  algorithm 
can  be  improved  by  eliminating  the  normal  mode;  at  the  end  of  a 
collision  resolution  period,  a  new  collision  resolution  period  is 
immediately  entered  and  each  waiting  packet  randomly  joins  one  of 
k  subsets.  The  number  k  is  chosen  as  a  function  of  the  length  of 
the  preceding  collision  resolution  period  so  that  the  expected 
number  of  packets  per  subset  is  slightly  more  than  one.  Thus  the 
corresponding  tree  has  k  branches  rising  from  the  root  and  two 
branches  rising  from  each  non- leaf  node. 


Capet ana kis  C323  showed  that  this  algorithm  has  a  maximum 
throughput  of  ©.43  and  is  stable  for  all  input  rates  less  than 
0,43.  The  maximum  throughput  attainable  with  tree  algorithms  was 
later  increased  to  0.46  due  to  a  simple  improvement  first 
suggested  by  Massey  C353.  Mote  what  the  algorithm  does  when  the 
set  i nvo 1 ved  in  a  co 1 1 i s i on  is  split  into  t wo  subset s  of  wh i ch 
the  first  is  empty.  The  first  slot  following  the  collision  is 
then  idle  and  the  next  is  a  collision  involving  all  the  packets 
in  the  first  collision.  Massey’s  improvement  was  to  avoid  this 
predictable  colli son  by  resplitting  the  second  subset  of  a 
collision  set  whenever  the  first  subset  is  empty. 

The  next  improvement  in  throughput  was  due  to  Gal  lager  C  36 1 , 
and  somewhat  later  with  a  more  complete  analysis,  by  Tsybakov  and 
Mikhailov  C373J  this  involved  eliminating  the  tree  structure 
entirely.  We  shall  describe  this  algorithm  precisely  later, 
since  it  is  considerably  easier  to  analyze  than  the  tree 
algorithm.  First,  however,  we  view  it  as  another  modification  of 
the  tree  algorithm.  With  a  little  thought,  one  can  see  that  the 
number  of  packets  in  a  subset  that  has  had  a  collision  is  a 
Poisson  random  variable  conditional  on  the  number  being  2  or 
more.  If  the  packets  in  this  set  are  randomly  divided  into  two 
subsets,  then  it  can  also  be  seen  that  if  the  first  subset 
contains  2  or  more  packets  <i.e.  another  collision)  then, 
conditional  on  this,  the  number  of  packets  in  the  second  subset 
is  Poisson.  Thus,  as  far  as  the  algorithm  is  concerned,  this 
subset  is  statistically  identical  to  some  time  interval  of  new 


arrivals,  and  the  algorithm  would  be  improved  if*  rather  than 

wasting  a  slot  on  this  subset,  we  simply  treated  it  like  waiting 

new  arrivals.  We  will  get  to  the  bookkeeping  issue  of  how  to  do 

this  shortly,  but  note  that  if  we  eliminate  the  second  subset  as 

a  separate  entity  every  time  the  first  subset  is  divided,  then  we 
never  have  more  subsets  to  consider  than  we  started  with. 

The  easiest  way  to  do  the  bookkeeping  concerning  subsets  and 
waiting  packets  is  by  means  of  the  arrival  times  of  the  packets. 
If  all  the  packets  that  arrived  in  a  given  time  interval  are 
transmitted  in  a  slot  and  a  collision  results,  then  the  interval 
is  split  into  two  equal  sub  intervals  and  the  packets  in  the  first- 
sub  interval  are  regarded  as  the  first  subset  and  those  in  the 
second  as  the  second  subset.  With  this  approach,  packets  are 
always  sent  in  a  first  come  first  served  <FCFS>  order,  so  we  call 
this  a  FCFS  splitting  algorithm. 

We  now  express  the  algorithm  precisely.  Suppose  that  at 
integer  time  t  the  algorithm  has  successfully  transmitted  all 
packets  that  arrived  before  some  time  T  (not  necessarily 
integer).  In  the  slot  Ct,t+1),  all  the  packets  that  arrived 

between  T  and  T+u  are  transmitted.  The  parameter  u  is  determined 

\ 

by  all  each  transmitter  based  on  the  history  of  the  feedback  up 
to  time  t.  The  transmitters  also  calculate  T  based  on  the 
feedback  history.  It  is  helpful  to  view  the  packet  arrivals  in 
CT,t>  as  being  in  a  distributed  queue  (see  fig.  4.3>.  We  would 
like  to  allocate  the  queued  packets  one  at  a  time  starting  at  the 
front  of  the  queue,  but  the  individual  arrival  times  are  unknown 
except  that  each  transmitter  containing  a  packet  knows  that 
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packet's  arrival  time.  Thus  the  algorithm  attempts  to  allocate 
an  interval  u  at  the  front  of  the  queue  for  the  next  slot  so  as 
to  transmit  the  waiting  packets  as  quickly  as  possible.  Note 
that  maximizing  the  probability  of  success  in  the  next  slot  is 
not  the  best  thing  to  do  since.,  as  we  have  seen,  a  collision  in 
the  next  slot  allows  a  higher  throughput  in  the  succeeding  few 
slots  than  is  possible  with  an  idle  slot  or  successful  slot. 

The  algorithm  given  below  determines  the  allocation  interval 
gi<t>  and  head  of  queue  time  T<t>  for  the  slot  Ct,t  +  1>  in  terms  of 
the  allocation  interval  u<t-l>,  head  of  queue  T<t-1>,  and  the 
feedback  <0.1 ,  c)  for  the  slot  Ct— l»t>.  There  is  also  a  binary 
state  Q<t>sC1.2>  which  is  a  function  of  Q<t-1>  and  the  feedback 
for  C  t  - 1 ,  t  >  .  The  state  Q<t-1>  also  enters  into  the  determination 
of  u.<t>  and  T<t>.  Q<t>  is  set  to  2  if  the  interval  used  in  slot 
Ct-l.t)  has  been  divided  by  2  for  slot  Ct.t+1>  and  is  1 
otherwise.  Thus  GKt>  is  the  number  of  subsets  currently  under 
cons i derat  ion.  The  algorithm  also  has  a  parameter  that 
determines  the  size  of  allocation  interval  to  be  used  after  a 
collision  resolution  period  is  completed.  For  maximum 
throughput,  u0  turns  out  to  be  2.6.  Note  that  the  allocation 
interval  is  also  limited  by  t-T<t>,  the  interval  of  arrival  times 
that  are  still  waiting  for  transmission. 


if  feedback  =  c  then 

T  <  t  >  =  T <  t  - 1  > ;  q  <  t  >  =  2 ; 

U  v  t  >  =  U  <  t  - 1  >  /2  <4.4  > 

if  feedback  =  S  or  1  and  GKt-l>  =  1  then 
T < t  >  =  T<t-i >+u<t-l > ;  Q<t>  =  15 

M<t>  =  minCngj t-TCt > D  <4.5  > 

if  feedback  =  1  and  Q<t— 1>  -  2  then 
T<t  >  =  T  <  t - 1 >  +u  <  t - 1 >  5  Q<t>  *  15 

H<t>  «  u  <  t  - 1  >  <4.6  > 

i f  feedbac k  =  ©  and  Q  <  t  - 1  >  =  2  t hen 
T<t  >  =  T < t - 1  >  +u <  t  - 1  >  J  GKt )  =  25 

U<t>  =  u<t-l>/2  <4.7  > 

In  case  of  a  collision  in  slot  Ct-l*t>,  Eq.  <4. 4)  splits  the 
allocation  interval  CT<t-l>,  T<t-1 >+w<t-l > >  into  two  intervals 
[T<t-1>,  T <t  — 1 >+u<t  — 1 >/2>  and  CT<t-l >+w<t-l >/2,  T<t-1 >+u<t-l > > . 
Q<t >  =  2  allows  the  algorithm  to  “remember"  the  existence  of 
these  two  sub intervals.  If  there  was  a  previous  sub interval  from 
CT<t-l>+u<t-l>>  T  <  t  - 1 )  +2u < t - 1  > > »  the  al gor i t hm  *' forget s ”  about  i t 
at  this  point*  regarding  that  sub interval  as  part  of  the  waiting 
queue.  Given  two  or  more  packets  in  CT<t-l>ji  T<t-1  >+2u<t-l  >  >  and 
two  or  more  packets  in  CT<t-l>,  T<t-1 >+u<t-l > ) ,  the  number  of 
packets  in  CTCt-O+uXt-l  > »  T<t-1  >+2iKt-l  >  >  is  easily  seen  to  be 
Poisson  with  parameter  \w<t-l>. 

Eg.  <4.5>  corresponds  to  the  end  of  a  collision  resolution 
period  or  a  subsequent  period  with  no  collisions  and  simply  moves 
the  head  of  the  queue  and  allocates  a  new  interval.  Eq.  <4.6? 
corresponds  to  a  successful  transmission  of  the  first  sub interval 
from  a  previous  collision  and  movement  to  the  second  sub interval. 
Finally  <4.7>  corresponds  to  Massey's  improvement  on  the  tree 


a  1  go r  i  t  hm  when  a  co  1 1  i  s  i  on  f o  1 1  owed  by  an  idle  (  or  perhaps  sevea  1 
idlest  is  followed  by  splitting  the  second  sub interval. 

The  FCFS  splitting  algorithm  can  be  analyzed  as  a 

homogeneous  Markov  chain,  using  Q(t>,  u ( t  >  and  t-N(t>  as  the 

state  for  integer  values  of  t.  It  is  simpler,  however,  to 

analyze  a  single  collision  resolution  period,  starting  at  a  t  for 

which  u(t >  =  u0  and  Q(t>  =  1  and  ending  immediately  before  the 

next  t  for  which  u<t>  =  min(u0, N(t >-t >  and  Q<t>  =  1.  The 

resulting  Markov  chain  is  then  independent  of  N<t>-t  ( aside  from 

the  initial  assumption  that  N(t>— t  2  y.0>  and  allows  us  in 

principle  to  find  the  distribution  of  the  number  of  slots  and 

number  of  successful  transmissions  in  a  collision  resolution 

period.  Note  that  in  each  update  of  u  (aside  from  the  beginning 

of  the  collision  resolution  per iod),  u  either  stays  the  same  or 

is  divided  by  2,  so  that  u  *  2-iu0  in  all  cases  for  some  integer 

i  >0.  Thus  each  state  of  the  chain  can  be  represented  as  Sj,* 

where  j  —  Q<t)sCl,2>  and  i  is  such  that  u(t>  =  2-iu0.  In  state 

®2,  i  *•*>!*>  the  only  possible  transitions  are  to  S2,i+1  if  an 

idle  or  collision  occurs  or  to  S,  *  if  a  success  occurs.  From 

i » i 

i  2  0,  the  only  possible  transitions  are  to  §2, if  a 
collision  occurs  or  to  S1>0  (represent ing  the  end  of  the  per i od > 
otherwise  (see  fig.  4.4>. 

011  that  remains  to  complete  the  chain  is  to  calculate  the 
transition  probabi  1  it ies,  ^2,1  *or  a  ion  from  $2,i  ®l,i 

and  for  a  transition  from  to  Sjj0.  In  state  %2,i’  we 

have  two  sub intervals  each  of  size  =  u02~ 1 .  The  number  of 
packets  in  each  sub interval  is  a  Poisson  random  variable  with 


parameter  Xuj  conditional  on  the  sum  of  the  number  of  packets  in 
the  two  sub intervals  being  two  or  more.  The  transition  to 
occurs  if  the  first  sub interval  contains  exactly  one  packet  <i.e. 
the  transmission  of  the  first  sub interval  is  successful > .  The 
probability  of  this  is  then 


->..U .  -Xu  ■ 
Xuj  ©  1 C 1-e 


2,  i 


— 2Xu 


5  i  >  1 


<4.  S> 


1  - 


1<l+2Xui> 


In  state  Sj^,  i  't  1,  we  are  about  to  transmit  the  second  of 


two  sub intervals  each  of  size  Uj.  The  number  of  packets  in  each 


subinterval  is  Poisson  with  parameter  Xu*  conditional  both  on  the 
sum  being  two  or  more  and  the  first  interval  containing  exactly 
one  packet.  This  means  that  the  number  of  packets  in  the  second 
sub interval  is  Poisson  conditional  on  one  or  more  packets  in  the 
second  sub  i  nt  erva  1 .  The  probab  ility  of  a  t  r  ans  i  t  i  on  to  is 

then  the  probability  of  exactly  one  packet,  so 


-2u 

Xuj* 


i 


;  i  >  l 


<4.9> 


Finally  the  probability  of  a  direct  transition  from  to  S1(0 

is 


-Xu 


1,0 


<1  + 


Xu0>< 


0 


<4. 10> 


It  is  now  straight  forward,  by  computer  iteration,  to  find 
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the  probability  that  each  state  is  entered?  starting  at  Slj0, 
before  the  first  return  to  SjjP1.  Noting  that  the  successful 
transmissions  correspond  to  the  transitions  from  S-.  ,  to  Si  4  for 
each  i  >  1  ,  transitions  from  to  S1>Cr.,  for  i  >  1,  and 

successful  transmissions  directly  from  S  j  ^  0  to  S1>0,  we  can 
calculate  the  expected  number  of  successful  transmissions  and  the 
expected  number  of  slots  per  collision  resolution  interval.  It 
turns  out  that  the  ratio  of  these  two  expected  values  is  less 
than  X  for  all  X  <  0.4871. 

It  can  be  seen  from  <  4 . 9  >  and  <4. 10>  that  the  probability  of 
reaching  S2 , j  ,  given  that  has  been  reached,  tends  to  1/2 

as  i  increases.  Thus  the  number  of  slots  and  the  number  of 
successful  transmissions  in  a  collision  resolution  interval  both 
have  moment  generating  functions.  From  this,  we  can  see  that  for 
any  starting  value  of  t-T<t>  and  any  X  <  0.4871,  the  number  of 
slots  required  to  reach  the  end  of  a  collision  resolution 
interval  where  t-T<t>  <  u.0  also  has  a  moment  generating  function 
and  thus  the  algorithm  is  stable  for  X  <  0.4871. 

The  expected  delay  for  this  algorithm  is  considerably  harder 
to  analyze  than  the  maximum  throughput.  Tsybakov  and  Likhanov 
C383  have  found  an  upper  bound  on  delay  and  more  recently  Huang 
and  Berger  C393  have  constructed  tight  upper  and  lower  bounds  as 
well  as  simulation  results.  The  expected  delay  is  about  5  1/2 
slots  at  X  =  1/e  and  about  16  slots  at  X  =  0.46. 

The  FCFS  splitting  algorithm  can  be  improved  somewhat  if  the 
intervals  are  split  in  an  optimal  way  after  collisions.  Because 
of  the  possibility  of  more  than  two  packets  in  a  collision,  equal 
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sub  intervals  are  not  quite  optimal.  Mosel  y  and  Hurnblet  E48  3  and 
Tsybakov  and  Mikhaslov  C 37 1  show  that  choosing  the  optimum 
sub intervals  increases  the  maximum  throughput  to  8.4878. 

Recently  another  improvement  of  3.6  x  18  '  has  been  made  by 
Vvedenskaya  and  Pinsker  C413.  Although  this  gain  is  small,  it  is 
of  theoretical  interest  since  it  departs  from  the  principal  of 
always  resolving  one  collision  before  trying  any  new  intervals. 

Considerable  effort  has  been  spent  on  finding  upper  bounds 
to  the  maximum  throughput  that  can  be  achieved  using  the 
assumptions  a>  to  e>  C42,  43,  44,  45,  463.  The  tightest  bound 
known  is  8.537  an  is  due  to  Mikhailov  and  Tsybakov  [46 3. 
Pippenger's  result  C423  is  also  of  particular  interest  since  he 
shows  that  if  the  amount  of  feedback  is  increased  to  give  the 
number  of  packets  involved  in  each  collision,  then  any  throughput 
up  to  one  may  be  achieved. 

One  negative  aspect  of  FCFS  splitting  algorithms  <and  also 
Massey's  improvement  on  the  Tree  algorithms}  is  their 
susceptibility  to  noisy  feedback.  If  an  idle  slot  is  mistakenly 
fed  back  to  the  transmitters  as  a  collision,  then  the  algorithm 
as  stated  will  forever  continue  to  split  a  smaller  and  smaller 
second  sub interval.  This  problem  could  be  solved,  of  course,  by 
only  splitting  a  given  number  of  times  in  a  row  on  receipt  of  8 
feedback  and  then  trying  the  entire  interval.  The  general 
subject  of  noisy  feedback  is  still  not  well  understood,  but  a 
number  of  partial  results  are  known  C35,  47,  483.  The  review 
paper  by  Tsybakov  C433  also  reviews  many  variations  on  collision 
resolution  algorithms  for  a  variety  of  assumptions. 
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One  interesting  approach  to  noisy  feedback  and  other 
Mar i at  ions  from  the  ideal  model  above  is  that  of  developing 
algorithms  that  are  as  simple  and  robust  as  possible.  Mat-hys  and 
FI a. jo let  C493  have  recently  developed  an  algorithm  with  an 
attractive  tradeoff  between  simplicity  and  throughput.  Newly 
arriving  packets  are  always  transmitted  in. the  slot  after  their 
arrival,  and  backlogged  packets  use  a  ternary  tree  algorithm. 
Massey's  improvement  on  the  tree  algorithm  is  not  used,  thus 
avoiding  the  above  deadlock  problems  with  noisy  feedback; .  The 
resulting  maximum  stable  throughput  is  9.40.  It  is  rather 
surprising  that  a  throughput  greater  than  1/e  is  possible  while 
always  allowing  new  arrivals  to  transmit  in  the  next  slot. 

For  multiaccess  systems  with  a  finite  number  of  users,  it  is 
also  of  interest  to  modify  these  splitting  algorithms  so  as  to 
take  advantage  of  the  finite  number  of  transmitters  and  to  make  a 
graceful  transition  from  collision  resolution  to  TDMA  as  the 
arrival  rate  increases.  Specific  approaches  to  this  are 
discussed  in  150,513.  The  approach  in  E51D  is  also  of  interest 
because  of  drawing  a  parallel  between  splitting  algorithms  and 
group  testing,  as  developed  in  the  statistics  community  in  the 


40's  and  50's 


4.3  CARRIER  SENSING:  We  now  want,  to  chanae  the  basic 
assumptions  a)  to  e)  .  Note  that  in  many  mui  hi acces  :::  sv stems 
such  as  local  networks,  the  transmi liters  can  hear  whether 
the  other  transmitters  are  sendinq  anything.  In  such  a 
situtation,  it  makes  sense  to  give  up  the  strict  slotting 
specified  in  assumption  a),  and  assume  instead  that  a 
transmitter  can  start  to  send  a  packet  in  the  middle  of  a 
data  slot  if  no  other  transmitters  are  currently  sending. 
This  change  is  far  more  important  than  simply  allowing  idle 
slots  to  be  used  more  efficiently,  since  now  packets  car- 
start  at  different  mini  slot  times,  thus  avoiding  many 
col lisions. 

Let  «  be  the  time  required  for  all  sources  to  determine 
that  nothing  is  being  transmitted;  ie.  cc  is  the  sum  of  the 
maximum  propagation  delay  between  sources  and  the  time 
required  by  a  receiver  to  reliably  distinguish  between 
signal  and  no  signal.  Assume  that  c<  time  units  after  the 
beginning  of  a  slot,  if  nothing  is  being  transmitted  in  that 
slot,  then  the  slot  terminates  and  a  new  slot  begins.  Thus 
idle  slots  (sometimes  called  minislots)  last  for  cc  time 
units  and  slots  with  one  or  more  packets  last  for  1  time 
unit  as  before.  We  still  assume  that  all  packets  require 
one  time  unit  for  transmission,  that  feedback  is 
instantaneous  at  the  end  of  a  slot,  that  arrivals  are 
Poisson  with  intensity  a,  and  that  there  are  effectively  an 
infinite  number  of  sources.  We  first  modify  slotted  Aloha 
for  this  new  situation  and  then  modify  the  FCFS  splitting 


algorithm.  These  techniques  are  called  carrier  sense 
multiple  access  <CSMA> .  although  they  do  not  imply  the  use 
jt  a  carrier  but  simply  the  ability  to  quick! y  racoon i^a  the 

use  of  the  channel  by  another  transmitter. 

We  can  model  this  situation  in  almost  the  same  way  as 
before.  The  only  difference  is  that  idle  slots  now  last  for 
a  duration  «,  whereas  successful  and  collision  slots  each 
last  for  one  unit  of  time.  For  slotted  Aloha,  if  a  new 
packet  arrives  at  a  transmitter  when  an  idle  mini  slot  is  in 
progress,  the  packet  begins  transmission  at  the  end  of  that, 
minislot  (thus  turning  the  next  slot  into  a  full  slot).  If 
a  transmission  is  in  progress,  the  packet  is  regarded  as  a 
backlogged  packet  and  begins  transmission  with  some  given 
probability  p  after  earn  idle  minislot.  This  technique  was 
called  non-persistent  CSMA  in  the  original  description  C523; 
in  an  inferior,  persistent,  variaton,  all  transmission 
attempts  during  a  busy  slot  would  simply  be  transmitted  at 
the  end  of  that  slot,  thus  causing  a  collision  with  a  rather 
high  probability.  We  ignore  this  alternative  form  in  what 
f  ol  1  ows. 

To  analyze  CSMA,  we  can  use  a  Markov  Chain  again,  using 
the  number  of  backlogged  packets  as  the  state  and  the  ends 
of  minislots  as  the  state  transition  times,  Rather  than 
write  out  the  state  transition  equations,  which  are  not 
particularly  insightful,  we  simply  modify  the  drift  in  (4.2) 
for  this  new  model.  The  expected  number  of  arrivals  in  the 


minislot  befors  the  transition  is  *oc,  and  with  probability 


a  1-p)  \  this  minislot  is  -followed  by  a  full  slot  with  a 
expected  arrivals.  Mote  that  there  is  always  an  unused 
mini  slot  at  the  end  of  each  full  slot,  but  the  arrivals  in 

that  mini  slot  are  considered  as  part  r>f  the  following 
transition.  The  model  could  be  changed  to  eliminate  this 
wasted  minislot,  but  the  difference  is  negligible  for  small 
«.  The  expected  number  of  departures  per  state  transition 
is  simply  the  probability  of  a  success. Thus 

Dk  =  A-:c  +  a  E  1  <  1  -p  >  k  3  -  CAoc+pk/  (1-p)  He-*®  ( l-p)k 

(4.11) 

This  is  minimized  over  p  at 

. _  _  1  -  a  <  1  +cc ) 

P  3  - 

k  -  a  < l+a)  (4. 12) 


The  stability  issues  with  CSMA  slotted  Aloha  are  almost 
the  same  as  with  ordinary  slotted  Aloha.  One  can  control  p 
by  monitoring  the  feedback,  or  one  can  simply  operate  at  a 
small  value  of  a  and  p  and  hope  that  the  backlog  never 
becomes  too  large.  If  we  use  the  optimal  value  of  p  for  each 
k,  and  substitute  this  in  (4.11),  we  find  that  Dk  is 
negative  for  all  k  so  long  as 

>IM  4  (4.  13) 

By  expanding  this  in  a  power  series  for  small  cc,  we 


find  that  the  system  is  scable  for  all  a  less  than  f2cc 
optimal  value  of  p  then  satisfies  pk  &  / 2a.  It  is 


The 


interesting  to  observe  that  this  optimal  point  occurs  where 
the  time  spent  on  idle  mini  slots  is  approximately  equal  to 
that  spent  on  collisions;  naturally  there  are  many  mere 
idle  slots  than  collisions,  but  idle  slots  have  a  much 
shorter  duration.  Delays  also  tend  to  be  much  smaller  in  a 
CSMA  system  since  backlogged  packets  get  a  trans.  .ssion 
opportunity  every  minislot,  and,  although  the  probability  of 
transmitting  in  a  minislot  decreases  with  •( or ,  the 
probability  o-f  transmitting  per  unit  time  increases  as  1  /  i  o: . 

Next  consider  CSMA  with  pure  Aloha.  We  will  not 
analyze  this  in  detail,  but  note  that  with  the  same  carrier 
sensing  time  oc  and  the  same  transmission  probability  p,  the 
probability  o-f  collision  increases  by  a  -factor  o-f  2.  This 
means  that  p  should  be  decreased  by  a  factor  of  (2  for 
maximum  throughput,  and  thus  the  unslotted  system  has  a 
maximum  throughput  of  l-2foT  for  small  oc.  We  see  that  the 
difference  between  pure  and  slotted  Aloha  for  CSMA  is  quite 
small  for  small  a,  and  the  synchronization  required  for 
slotting  with  CSMA  is  somewhat  trickier  than  that  for 
ordinary  slotted  Aloha.  Thus  pure  Aloha  appears  to  be  the 
natural  choice  with  CSMA. 

Finally  consider  the  FCFS  splitting  algorithm  modified 
for  CSMA.  The  same  algorithm  as  in  <4. 4)  to  <4. 7)  can  be 
used,  although  the  parameter  pC)  should  be  chanaed,  and  as  we 
shall  see  shortly,  intervals  with  collisions  should  not  be 
split  into  equal  subintervals.  Since  collisions  waste  much 
more  time  than  idle  minislots,  the  basic  allocation  interval 


Fy,  should  be  chosen  very  small.  This  means  in  turn  that, 
collisions  with  more  than  two  packets  are  negligible  in  the 
analysis  of  the  algorithm,  and  thus  the  analysis  is  much 
simpler  than  before. 

As  before  we  find  the  expected  time  and  the  expected 
number  of  successes  in  a  collision  resolution  period, 
including  a  single  idle  or  successful  slot  as  a  degenerate 
case  of  a  collision  resolution  period.  Let  y  =  ah-.-..  With 
probability  e  %  an  original  allocation  interval  is  empty, 
yielding  a  collision  resolution  time  of  cc  with  no  successes. 
With  probability  ye  '¥ ,  there  is  an  initial  success,  yielding 
collision  resolution  time  1+oc  (as  before,  we  include  an 
empty  minislot  at  the  end  of  each  full  slot).  Finally,  with 
probability  (ya/2)e  ¥,  there  is  a  collision  yielding  a 
collision  resolution  time  of  1+T,  for  some  T  to  be 
calculated  later,  and  two  successes.  Thus, 

E (time/period)  at  «e~’1'  +  y <  1+a) e~'¥  +  ( 1+T>  (y^/2) e~’¥ 

(4. 14) 


E (packets/period)  as  ye-**’  +  2 (ya/2) e_v 


(4. 15) 


Note  that  we  have  used  the  approximation  that  only  two 
packets  occur  in  collisions  here.  As  before,  the  maximum 
throughput  that  can  be  achieved  is  the  ratio  of  (4.15)  to 


(4.  14)  , 


a 


maw 


at  (y  +  y*)/c«  +  y(l+oe)  +  (ya/2)  < 1+T) D 


(4. 16) 


4.25 


We  can  now  maximize  the  right  hand  side  of  (4.1.6)  over 
iie.  over  »V  ’  1  n  the  limit  of  small  oc,  we  get  the 

asymptotic  expressions 

21  |2«/  (T-r>  (4.17) 

a  ..  -  1  -  ■( 2c<  <T-iT  (4.  18) 

max 

Finally  we  must  calculate  T,  the  time  to  resolve  a 
collision  after  it  has  occurred.  Let  x  be  the  fraction  of 
an  interval  used  in  the  first  subset  when  an  interval  is 
split.  T  includes  the  time  a  for  the  idle  minislot  that 
always  follows  a  collision.  If  the  next  minislot  is  idle,  cc 
is  the  duration  of  the  mini  slot,  and  T-oc  is  the  expected 
time  still  remaining  to  resolve  the  collision.  Similarly, 
if  another  collision  occurs,  1+T  is  the  expected  time  for 
resolution.  Finally,  if  a  successful  transmission  occurs, 
2<l+oc)  is  the  required  time  for  resolution.  Thus 

T  at  a  +  <1-x)2T  +  x2<l+T)  +  4x(l-x)<l+«)  (4.19) 

T  is  minimized  by  x  »  'jcc+a*  -  a,  and  the  resulting 
value  of  T,  for  small  oc,  is  T  *  2+'|"cT.  Substituting  this  in 
(4. 18) ,  we  see  that 

Vox  2  1  -  <4-20> 

For  small  «,  then*  the  FCFS  splitting  algorithm  has  the 


same  maximum  throughput  as  slotted  Aloha.  This  is  not 
surprising,  since  without  CSMA,  the  major  advantaqe  of  the 
FCFS  algorithm  is  its  efficiency  in  resolving  collisions, 
and  with  CSMA,  collisions  rarely  occur.  It  is  somewhat 
surprising  at  first  that  if  we  use  the  FCFS  algorithm  with 
equal  subintervals  <ie.  x=l/2>,  then  we  are  limited  to  a 
throughput  of  l->|3c<.  This  degradation  is  due  to  a 
substantial  increase  in  the  number  of  collisions. 

The  same  type  of  analysis  as  used  here  can  be  used  for 
reservation  multiaccess  systems  and  a  variety  of  other 
conditions.  The  idea,  originally  due  to  Humblet  C533  is  to 
generalize  our  original  assumptions  a)  to  e)  to  allow  the 
durations  of  idle,  success,  or  collision  slots  to  all  be 
different.  Recoil  that  in  CSMA,  idle  slots  had  duration  a 
and  success  and  collision  slots  had  duration  l+a.  In  a 
reservation  system,  idle  and  collision  slots  would  have  the 
duration  required  to  send  a  reservation  packet,  whereas 
success  slots  would  have  the  duration  required  for  both  a 
reservation  and  a  message  transmission. 
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